3 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



Mill 

© Publication number: 



0 644 267 A2 



© 



EUROPEAN PATENT APPLICATION 



© Application number: 94111298.9 
@ Date of filing: 20.07.94 



© int. CI. 6 -. C12Q 1/02, C12N 9/02, 
C12N 15/53, C12N 15/81, 
//C12Q1/26 



@ Priority: 20.07.93 JP 201120/93 
21.07.93 JP 180246/93 
30.07.93 JP 208279/93 

@ Date of publication of application: 
22.03.95 Bulletin 95/12 

® Designated Contracting States: 
CH DE FR GB LI 

© Applicant: SUMITOMO CHEMICAL COMPANY, 
LIMITED 

Kitahama 4-chome 5-33 

Chuo-ku 

Osaka 541 (JP) 

@ Inventor: Hayashi, Koji 
14-7, Mefu 2-chome 
Takarazuka-shi, 
Hyogo-ken (JP) 
Inventor: Sakaki, Toshiyuki 
1-30-68, Higashitoyonaka-cho 
Toyonaka-shi, 
Osaka-fu (JP) 



CM 

< 

CO 
CM 

CO 



Inventor: Yabusaki, Yoshiyasu 

6-2-1, Minatojimanakamachi, 

Chuo-ku 

Kobe-shi, 

Hyogo-ken (JP) 

Inventor: Komai, Koichiro 

4-9-17, Sakuragaoka 

Minoo-shi f 

Osaka-fu (JP) 

Inventor: Kaneko, Hideo 

63, Koyamaitakura-cho, 

Kita-ku 

Kyoto-shi, 

Kyoto-fu (JP) 

Inventor: Nakatsuka, Iwao 

3-8-14, Hinomine, 

Kita-ku 

Kobe-shi, 

Hyogo-ken (JP) 

© Representative: VOSSiUS & PARTNER 
Postfach 86 07 67 
D-81634 Munchen (DE) 



© Method lor safety evaluation of chemical compound using recombinant yeast expressing human 
cytochrome P450. 

© There is disclosed a method for evaluation of the safety of a chemical compound, which includes the steps 
of- (a) reacting a chemical compound with recombinant yeast cells expressing, or in other words producing, 
human cytochrome P450 molecular species P450 1A2, P450 2C9, P450 2E1 and P450 3A4 together with a 
yeast NADPH-P450 reductase, which may be in the form of a fused enzyme with each of said human 
cytochrome P450 molecular species, or with the cell free extracts of the yeast cells; and (b) analyzing the 
resulting metabolite to determine the safety of the compound. According to this method, it can be determined 
whether a test compound will be converted into a carcinogenic or mutagenic form through the metabolism in the 
human liver, and whether the test compound or its metabolite has mutagenicity. 
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The present invention relates to a method for evaluation of the safety of a chemical compound using 
recombinant yeasts expressing human cytochrome P450. 

The cytochrome P450 is an enzyme catalyzing the mono-oxygenation of a substance in the human 
liver. 

5 It is known that recombinant human cells expressing heterogeneous human cytochrome P450 species 
have been used for determination of metabolisms and toxicities of chemical substances. However, this 
method is unsatisfactory as a method of evaluation of the safety of chemical compounds partly because the 
kinds of the human cytochrome P450 species expressed by the cells and the levels of the expression are 
so limited that the amount of metabolite obtained is not enough for determination of the metabolism and 

to toxicity, and partly because it requires not only a high density culture technique but a high cultivation cost. 
Accordingly, there has been a great demand for developing an advantageous method. 

As a result of the extensive study, the present inventors have found that yeasts are particularly suitable 
as hosts for production of human cytochrome P450 and yeast NADPH-P450 reductase to be used in in vitro 
determination of metabolisms and toxicities of chemical substances because yeasts grow so rapidly and 

15 can stably express both the human cytochrome P450 and yeast NADPH-P450 reductase at high expression 
levels to provide sufficient amounts of the metabolites in a short period of time, thereby enabling a precise 
and quick analysis of the metabolites. 

Moreover, they have also found that, despite that there are a considerable number of human 
cytochrome P450 molecular species, the human metabolic system for chemical compounds can be 

20 reproduced in vitro when at least four human cytochrome P450 molecular species, i.e., human cytochrome 
P450 1A2, P450 2C9, P450 2E1 and P450 3A4, are combined. 

Thus, the present invention provides a method for evaluation of the safety of a chemical compound, 
which comprises the steps of: 

(a) reacting a chemical compound with recombinant yeast cells expressing, or in other words producing, 
25 human cytochrome P450 molecular species P450 1 A2, P450 2C9, P450 2E1 and P450 3A4 together with 

a yeast NADPH-P450 reductase, which may be in the form of a fused enzyme with each of said human 
cytochrome P450 molecular species, or with the cell free extracts of the yeast cells; and 

(b) analyzing the resulting metabolite to determine the safety of the compound. 

The present invention further provides a method for determination of the human metabolite of a 
30 chemical compound, which comprises the steps of: 

(a) reacting a chemical compound with recombinant yeast cells producing human cytochrome P450 
molecular species P450 1A2, P450 2C9, P450 2E1 and P450 3A4 together with a yeast NADPH-P450 
reductase, which may be in the form of a fused enzyme with each of said human cytochrome P450 
molecular species, or with cell free extracts of the yeast cells; and 
35 (b) identifying the resulting metabolite. 

Figs. 1 to 4 show various primers for cloning human P450 genes. 
Fig. 5 shows a synthetic linker for human P450 gene cloning. 

Fig. 6 shows a method of constructing yeast expression plasmids for human P450 1A2. 

Fig. 7 shows a method of constructing yeast expression plasmids for human P450 2C9. 
40 Fig. 8 shows a method of constructing yeast expression plasmids for human P450 2E1. 

Fig. 9 shows a method of constructing yeast expression plasmids for human P450 3A4. 

Fig. 10 shows a method of constructing yeast expression plasmids for human P450 1A1. 

Fig. 11 shows a method of constructing yeast expression plasmids for human P450 2A6. 

Fig. 12 shows a method of constructing yeast expression plasmids for human P450 2B6. 
45 Fig. 13 shows a method of constructing yeast expression plasmids for human P450 2C8. 

Fig. 14 shows a method of constructing yeast expression plasmids for human P450 2C18. 

Fig. 15 shows a method of constructing yeast expression plasmids for human P450 2C19. 

Fig. 16 shows a method of constructing yeast expression plasmids for human P450 2D6. 

Fig. 17 shows a method of constructing a yeast expression plasmid containing an artificial fused 
so enzyme gene. 

Fig. 18 shows a method of constructing a yeast expression plasmid using a GAPDH promoter. 

According to the present invention, it can be determined whether a test compound will be converted 
into a carcinogenic or mutagenic form through the metabolism in the human liver, and whether the test 
compound or its metabolite has mutagenicity. *' 
55 Thus, the present invention provides a method for evaluation of safety of a chemical compound, and a 
method for determination of the human metabolite of a chemical compound. 
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Human Cytochrome P450 and Their Genes 

The yeasts capable of expressing, or producing, said enzymes can be obtained by transforming them 
with expression plasmids containing genes encoding said enzymes with a conventional recombinant DNA 
5 method. 

The human P450 molecular species to be used in the present invention include at least four human 
cytochrome P450 molecular species, i.e., human cytochrome P450 1A2, P450 2C9, P450 2E1 and P450 
3A4. The genes encoding these essential human cytochrome P450 molecular species and yeast NADPH- 
P450 reductase are reported in Nucleic Acids Res., 14, 6773-6774, 1986; J. Biochem., 102, 1075-1082, 
jo 1987; J. Biol. Chem., 261, 16689-16697, 1986; DNA, 7, 79-86, 1988; and J. Biochem., 103, 1004-1010, 
1.988. 

Although the kinds of P450 molecular species present in human liver vary among the race and 
individuals, the combination of said human P450 molecular species includes at least about 85% (molar 
ratio) of the total amount of the human P450 molecular species present in the human liver. Hence, the 

75 present method using the said combination of human P450 molecular species can accurately reproduce the 
human liver metabolism in vitro. 

The combination of these P450 molecular species may optionally be varied, taking into account of the 
amounts of these P450 molecular species in the liver: the amount of P450 3A4 present in the human liver is 
about 35±10% of the iotal amount of the human P450 molecular species; P450 2C9 about 25±10%; P450 

20 1A2 about 23±10%; and P450 2E1 about 17±10%. 

In addition to the above-mentioned combination, human P450 molecular species P450 2A6, P450 2C19 
and/or P450 2D6 (Biochemistry, 29, 1322-1329, 1990; Biochemistry. 30, 3247-3255, 1991; Am. J. Hum. 
Genet.. 45, 889-904, 1989) may also be added. In this case, the combined human P450 molecular species 
covers at least 90% of the total amount of the human P450 molecular species present in the human liver. 

25 The in vitro human metabolic system that reproduces accurately the human metabolism of a chemical 
compound, and can represent the differences among races and individuals can be obtained when these 
human P450 molecular species are properly combined, taking into account of the amounts of these species 
in the liver. 

Furthermore, at least one human cytochrome P450 molecular species selected from the group of P450 
30 1A1. P450 2B6, P450 2C8 and P450 2C18 (Science, 228, 80-83, 1985; Biochemistry, 23, 7340-7348, 1989; 
Nucleic Acids Res., 15, 10053-10054, 1987; Biochemistry, 30, 3247-3255, 1991) may be added to said 
human cytochrome P450 molecular species to reproduce in vitro the metabolism of the human liver more 
accurately. 

The nucleotide sequences coding for the human P450 molecular species are disclosed in SEQ ID NOs: 
35 1 to 38. 

Cloning of Genes 

The genes coding for the human cytochrome P450 molecular species are known and can be obtained 
40 by the conventional cloning methods. 

For example, they may be obtained by: 

(i) preparing a mRNA fraction containing the mRNA of the gene coding for human cytochrome P450 
molecular species; 

(ii) preparing a cDNA from the mRNA fraction using reverse transcriptase; 

45 (iit) preparing a cDNA library by inserting said cDNA into a pharge vector or a plasmid vector; and 

(iv) cloning the gene coding for the human cytochrome P450 molecular species from the cDNA library 
obtained above or from a commercially available human liver-derived cDNA library using a DNA 
fragment having an identical sequence to some part of the desired gene or an antibody reactive to the 
protein produced by the gene, 
so The gene may also be obtained from the above-described cDNA library by the PCR method. 

The gene coding for yeast NADPH-P450 reductase may be obtained by the same method as used for 
cloning of the genes coding for human P450 molecular species. More specifically, the gene may be 
obtained by such a known method as described in the Japanese Patent Laid-open Publication No. 62- 
19085. *" 

55 
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Construction of Yeast Expression Plasmids 

The yeasts capable of expressing said enzymes can be obtained by transforming them with expression 
plasmids containing genes encoding said enzymes with a conventional recombinant DNA method. 
5 The yeast expression plasmid having a gene coding for human P450 molecular species and a gene 
coding for the yeast NADPH-P450 reductase can be constructed by using a conventional recombinant DNA 
method. 

As to the promoter to be used for construction of the expression plasmids for the yeast of the present 
invention, there is no particular restriction so long as the promoter can be used in usual expression systems 
10 for yeasts, and a promoter of a yeast alcohol dehydrogenase gene (hereinafter referred to as ADH 
promoter), glyceratdehyde-3-phosphate dehydrogenase promoter (hereinafter referred to as GAPDH pro- 
moter), and phosphoglycerate kinase (hereinafter referred to as PGK promoter) are preferably used in the 
present invention. 

The ADH promoter can be prepared by a usual genetic engineering method, for example, from a yeast 
is expression vector pAAH5 possessing a yeast ADH1 promoter and terminator ("Methods in Enzymology" by 
Ammerer et al., vol.101, pp.1 92-201). The yeast ADH1 promoter is described in the U.S. Patent No. 299,733 
to Washington Research Foundation and it requires patent license from the patentee in a case of using the 
same for an industrial or commercial purpose. 

The yeast expression plasmid having both a gene coding for human P450 molecular species and a 
20 gene coding for the yeast NADPH-P450 reductase can be constructed by, for example, inserting an Notl 
fragment prepared from yeast expression vector pAAHSN possessing the ADH promoter and terminator 
(Japanese Patent Laid-open Publication No. 2-21 1880) to an Noti site of plasmid pARRN possessing a gene 
coding for yeast NADPH-P450 reductase (Japanese Patent Laid-open Publication No. 2-211880) and then 
inserting cDNA coding for the human P450 molecular species to the Hindlll site of the thus obtained 
25 plasmid pAHRR. Moreover, a vector obtained by exchanging a Hind III site of pAAH5N with other restriction 
enzyme site may be used for the same purpose. 

In the present invention a gene coding for an artificial fused enzyme comprising human .cytochrome 
P450 molecular species and yeast NADPH-P450 reductase can also be used. The artificial fused enzyme 
can catalyze mono-oxygenation reaction and the efficiency of the electron transfer from NADPH is so 
so improved that the activity of the mono-oxygenation reaction is much enhanced. Accordingly, a great amount 
of metabolic products can be obtained in a shorter period of time, enabling accurate analysis. 

The fused gene comprises a gene coding for the human cytochrome P450 molecule on the 5'-terminal 
and a gene coding for the yeast NADPH-P450 reductase on 3'-terminal. 

The gene coding for such an artificial fused enzyme can be constructed by ligating a gene coding for a 
35 human cytochrome P450 species and a gene coding for yeast NADPH-P450 reductase by a conventional 
recombinant DNA method, and the constructed gene is usually inserted to the Hind III site of the yeast 
expression vector pAAHSN having ADH promoter and ADH terminator described in the Japanese Patent 
Laid-open Publication No. 2-211880, 

40 Transformation of Yeast 

The yeast cells expressing the human P450 molecular species and yeast NADPH-P450 reductase or 
yeast cells expressing an artificial fused enzyme comprising human P450 molecular species and NADPH- 
P450 reductase can be obtained by introducing the thus constructed yeast expression plasmid into a yeast 
45 by a known method such as a protoplast method or a method using alkaline metal salt (LiCl). 

In the present invention, two or more expression plasmids may optionally be introduced into a single 
strain of yeast so that the yeast can express two or more molecular species simultaneously. 

As the hosts, Saccharomyces cerevisiae is used in the method of the present invention, in particular, 
Saccharomyces cerevisiae AH22 (ATCC 38626) is preferably used. 

50 

Reaction of Test Compound 

In the method of the present invention, a test compound is reacted with a mixture of at least said four 
human P450 molecular species, or separately with each of the said four human P450 molecular species ift 
55 the presence of the yeast NADPH-P450 reductase. 

Alternatively, it may be first reacted with one or more of the essential human P450 molecular species, 
and then with a' mixture of, or separately with the rest of them; each of the reactions is carried out in the 
presence of the yeast NADPH-P450 reductase. 
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The reaction is carried out by reacting a test compound with the yeast obtained by the transformation 
with an expression plasmid containing a gene encoding a human P450 molecular species and a gene 
encoding yeast NADPH-P450 reductase, or a fused gene encoding a fused enzyme of a human P450 
molecular species and a yeast NADPH-P450 reductase, or with the cell free extracts of the yeast cells. 
5 In the reaction of a test compound with the enzymes of the present invention, living yeast cells and 
their cell free extracts are usually used. 

As the cell free extracts, subcellular fraction of cells containing microsomal fractions, or fractions 
containing both microsome and cytoplasm is used. The cell free extracts or fractions can be prepared, for 
example, by a known method (DNA, Vol.4, No. 3, pp.203-210 (1985)). 
70 However, the present invention can be preferably carried out with the cell free extracts, especially with 
microsomal fractions of the cells. But, when biological analytic method is used to determination of the 
mutagenicity or carcinogenicity, fractions containing microsome and cytoplasm are preferably used. 

The reaction can be conducted by adding a test compound to a culture solution or a buffer solution of 
yeast celts or cell free extracts, and the resultant solution is usually incubated at a temperature, for 
J5 example, at about 10 • C to 40 • C, for about 0.1 to 48 hours. 

The amounts of the yeast cells or cell free extracts and the compound vary depending on the 
conditions such as reaction temperature, reaction time and the kind of the test compound to be used. 

For instance, the amount of the yeast cells to be used in the solution is preferably from about 10 5 to 
about 10 10 per 1 ml 6f the solution , preferably, from about 10 7 to about 10 a per 1 ml of the solution. When 
20 cell free extracts are used, from about 10 10 to about 10 15 of P450 molecules per l ml of the solution, 
preferably from about 10 12 to about 10 13 of P450 molecules per 1 ml of the solution is usually used. 

The amount of the compound to be added is preferably within a range of from about 0.01 umol to 
about 1umo! per 1 ml of the solution. 

The above ranges may be optionally varied, if necessary. 

25 

Determination of Metabolites 

The metabolites present in the reaction solution can then be subjected to elucidation of the chemical 
structures and the measurement of their amounts. The analysis of the chemical structure can be conducted 
30 by known methods ("Guide to Apparatus Analysis (2)", edited by Jiro Shiokawa et al., (revised edition) first 
print, issued from Kagaku Dojin (1985); "Spectral Identification for Organic Compound" by R.M. Silverstein, 
fourth edition, third print, issued from Tokyo Kagaku Dojin (1984)). 

From the results of the analysis of the metabolites, it can be determined whether the tested compound 
will be detoxicated or metabolized into a carcinogen in the human liver when administered. 

35 

Determination of Toxic Effects of Metabolites 

The toxic effects, in particular mutagenicity, of the resulting metabolites can be determined by a 
conventional biological analytic method such as the Ames Test. For example, the metabolites present in the 

40 reaction solution are allowed to react with mutant bacteria such as histidine requiring Salmonella strain 
(Salmonella typhimurium (his-)), or tryptophan requiring Escherichia coil (Escherichia coil (trp-)), and then 
determine whether the metabolites cause the back mutation of the bacteria whether the colonies of revertant 
which is not requiring the amino acid (His + or Trp + ) are formed, and, if formed, how many colonies*. In 
place of the bacteria, mammalian cells such as MCL-5 cells, which are sensitive to cell toxicity of a 

45 chemical compound (U.S. Patent No. 4,532,204), can be used. 

In this method, the compounds that cause the back mutation will be judged to be mutagenicity test- 
positive. 

It is also possible to simultaneously proceed the step (a) of reacting the test compound with the yeast 
cells or the cell free extracts, and the step (b) of analyzing the metabolites present in the reaction solution. 

so The mutagenicity of arylamine derivatives, which are known to be metabolized by the liver into a 
mutagens, can be examined by the biological analytic method. For example, the mutagenicity of 2- 
aminoanthrathene can be detected at the concentration of about 0.1 ug of 2-aminoanthrathene when 20 
pmol of P450 1A2, which is active specifically to 2-aminoanthrathene, is used (Table 1). 

In the present invention, a metabolic probe for a human P450 molecular species can be obtained. *' 

55 If a certain chemical compound is converted by a particular human P450 molecular species into a 
specific metabolite, the amount of such a human P450 molecular species can be determined by detecting 
such a metabolite in excretions such as blood or urine of a living body who has been administered the 
compound, and such a compound is called a metabolic probe. 
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In the present invention, such a metabolic probe can be obtained by screening the metabolites obtained 
by reacting chemical compounds with the yeasts of the present invention. 

Examples 

5 

The present invention will be further illustrated by the following examples, which are not to be construed 
to limit the scope thereof. 

Preparation of cDNA coding for human P450 molecular species 

70 

.cDNA coding for human P450 molecular species were obtained from commercially available human 
liver cDNA library (Clontech Co.) by the PCR method using primers for cloning human P450 genes as 
shown in Figs. 1 to 4, and a method using a synthetic linker for human P450 gene cloning as shown in Fig. 
5. Thus obtained nucleotide sequences for the cDNA and the deduced amino acid sequences are shown in 
75 the sequence listing. 

Relationship between SEQ ID NOs and human P450 molecular species are as follows: 
1. The essential human cytochrome P450 molecular species for the present invention. 



(1) SEQ ID NO: 1 


1A2 


(2) SEQ ID NO: 3 


2C9 


(3) SEQ ID NO: 5 


2E1 


(4) SEQ ID NO: 7 


3A4 



25 2. Auxiliary Human cytochrome P450 molecular species 



(1)SEQ ID NOs: 9, 11 and 13 


1A1 


(2) SEQ ID NOs: 15 and 17 


2A6 


(3) SEQ ID NO: 19 


2B6 


(4) SEQ ID NOs: 21, 23 and 25 


2C8 


(5) SEQ ID NO: 27 


2C18 


(6) SEQ ID NO: 29 


2C19 


(7) SEQ ID NOs: 31 , 33, 35 and 37 


2D6 



Construction of yeast expression plasmids: p1A2 and p1A2R 

Fig. 6 shows a method of constructing yeast expression plasmids for human P450 1A2. The protein 
40 coding region of P450 1A2 gene of about 1.5 kb excluding about 40 bp at the 5'-terminal was amplified by 
the PCR method using the primers shown in Fig. 1. The resultant fragment of about 1.5 kb was cleaved with 
Sacl and sub-cloned to a pUC118 vector. About 40 bp at the 5'-terminal was chemically synthesized as the 
linkers shown in Fig. 5 and sub-cloned between the Hindlll and Sacl sites of the pUC118 vector. The 
plasmid having the 1.5 kb fragment was digested by Hindlll, blunted, and then ligated with an EcoRI linker. 
45 The EcoRI-SacI fragment was prepared from the resulting plasmid and ligated into the plasmid containing 
the S'-terminal 40 bp. Then, it was treated with EcoRI and blunted. A Hindlll linker was inserted into the 
blunted fragment. The obtained fragment then cleaved with Hind 1 1 J was inserted into pAAHSN and pAHRR 
to construct a yeast expression plasmid p1A2 for human P450 1A2, and a yeast expression plasmid p1A2R 
for simultaneous expression of human P450 1A2 and yeast NADPH-P450 reductase. 

so 

Construction of yeast expression plasmids: p2C9 and p2C9R 

Fig. 7 shows a method of constructing yeast expression plasmids for human P450 2C9. The protein 
coding region of 450 2C9 gene was divided into two fragments of about 0.9 kb and about 0.6 kb, and the' 
55 fragments were amplified by the PCR method using the primers shown in Fig. 1. The resultant fragment of 
about 0.9 kb was cleaved with Pstl and sub-cloned to a pUC B vector, which was prepared by exchanging 
the cloning site located between the two Hind 111 sites, one of which was obtained by converting the EcoRI 
site of pUCl9, with the following cloning sites: 



6 



EP 0 644 267 A2 



EcoRI Spel PstI BamHI Kpnl Hindlll 

Hindlll Xbal SphI Sail Smal SacI 

1 1 1 I I I I I I I 1 1 

5 

The fragment of about 0.6 kb was incorporated between the Xbai and PstI sites of the plasmid having 
the 0.9 kb fragment to ligate the two segments. The Kpnl site of the plasmid was blunted. An Xbal linker 
was inserted to the blunted plasmid. The Xbal fragment containing the coding region was cut out from the 

70 resultant fragment. A modified pUC vector, pUCAN, was constructed by replacing the EcoRI and Hindlll 
sites with Notl sites, followed by insertion of the Notl fragment prepared from pAAHSN between the two 
Notl sites. The Hindlll site of pUCAN vector having the ADH promoter and terminator regions in the pUC 
vector was blunted and inserted into pUCANX introduced with the Xbal linker. The obtained plasmid was 
cleaved with Notl and inserted into pAAHSN and pAHRR treated in a similar manner with Notl to construct a 

;s yeast expression plasmid p2C9 for human P450 2C9, and a yeast expression plasmid p2C9R for 
simultaneous expression of human P450 2C9 and yeast NADPH-P450 reductase. 

Construction of yeast expression plasmids: p2E1 and p2E1R 

20 Fig. 8 shows a method of constructing yeast expression plasmids for human P450 2E1. The protein 
coding region of P450 2E1 gene was divided into two fragments of about 0.5 kb and about 1.0 kb, both of 
which were amplified by the PCR method using the primers shown in Fig. 1. The resultant fragment of 
about 0.5 kb was cleaved with EcoRI and BamHI and sub-cloned to a pUCl 18 vector. Then the fragment of 
about 1.0 kb was incorporated between the BamHI and SphI sites to ligate the two fragments. This was 

25 cleaved with EcoRI, and SphI, and inserted into pUC B first and then cut out with Hindlll. The resultant 
fragment was inserted into pAAHSN and pAHRR vectors to construct a yeast expression plasmid p2E1 for 
human P450 2E1 t and a yeast expression plasmid p2E1R for simultaneous expression of human P450 2E1 
and yeast NADPH-P450 reductase. 

30 Construction of yeast expression plasmids: p3A4 and p3A4R 

Fig. 9 shows a method of constructing yeast expression plasmids for human P450 3A4. The protein 
coding region of P450 3A4 gene was divided into two fragments of about 0.6 kb and about 0.9 kb, both of 
which were amplified by the PCR method using the primers shown in Fig. 2. The resultant fragment of 

35 about 0.6 kb was cleaved with SacI and sub-cloned to a puCH8 vector. Subsequently, it was cleaved with 
EcoRI and blunted. An Xbal linker was ligated to the blunted fragment. The fragment of 0.9 kb was cleaved 
with Xbal and SacI, and incorporated to the resultant fragment above, thus the two fragments were ligated. 
After cleaving the plasmid with Spht, it was blunted. An Xbal linker was ligated to the blunted fragment, 
from which the Xbal segment was cut out and inserted to an Xbal site of pUCANX. This was cut out with 

40 Notl and inserted into pAAHSN and pAHRR treated in a similar manner with Notl. Thus a yeast expression 
plasmid p3A4 for human P450 3A4, and a yeast expression plasmid p3A4R for simultaneous expression of 
human P450 3A4 and yeast NADPH-P450 reductase were constructed. 

Construction of yeast expression plasmids: p1A1 and p1A1R 

45 

Fig. 10 shows a method of constructing yeast expression plasmids for human P450 1A1. The coding 
region for P450 1A1 protein was divided into two fragments of about 1.0 kb and about 0.5 kb and the 
resultant fragments were amplified by the PCR method using the primers shown in Fig. 2. Thus obtained 
fragment of about 1.0 kb was cleaved with Xbal and Sact and sub-cloned to a PUCA vector, which was 
so prepared by exchanging the cloning site located between the two Hindlll sites, one of which was obtained 
by converting the EcoRI site of pUC19, with the following cloning sites: 



Xbal Spel PstI BamHI Kpnl Hindlll 

Hindlll EcoRI SphI Sail- Smal SacI 
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The amplified fragment of about 0.5 kb was sub-cloned into the Hincll site of a pUC 19 vector and the 
resultant plasmid was then cleaved with Sacl. The cleaved fragment was ligated with the plasmid having the 
1.0 kb fragment. After cutting out the coding region from the thus obtained 1A1 gene with Hindlll, the 
fragment was inserted to the Hindlll site of the yeast expression vector pAAHSN having ADH promoter and 

5 terminator regions, and to the same site of vector pAHRR for simultaneous expression of P450 and yeast 
NADPH-P450 reductase of which gene is located upstream of the P450 gene. Thus yeast expression 
plasmid p1A1 for human P450 1A1 and yeast expression plasmid p1A1R for simultaneous expression of 
human P450 1A1 and yeast NADPH-P450 reductase were constructed. 

In addition two kinds of human P450 1A1 gene fragments which were different only in a small portion of 

w the nucleotide sequence were obtained in a similar manner and used to construct two kinds of yeast 
expression plasmid for human P450 1A1, p1A1 Variant 1 and p1A1 Variant 2, and two kinds of plasmids for 
simultaneous expression of human P450 1A1 and yeast NADPH-P450 reductase, p1A1R Variant 1 and 
p1A1R Variant 2. 

15 Construction of yeast expression plasmids: p2A6 and p2A6R 

Fig. 1 1 shows a method of constructing yeast expression plasmids for human P450 2A6. A protein 
coding region of P450 2A6 gene was divided into two fragments of about 0.6 kb and about 0.9 kb, both of 
which were amplified by^the PCR method using the primers shown in Fig. 2 to yield two kinds of human 

20 P450 2A6 gene fragments which were different only in a small portion of the nucleotide sequence. The 
resultant fragment of about 0.6 kb was cleaved with Xbal and Hincll, and sub-cloned to a pUC A vector. 
Then the fragment of 0.9 kb was incorporated between the Hincll and Kpnl sites to ligate the two fragments. 
The obtained fragment was cleaved with Hindlll and inserted into pAAHSN and pAHRR to construct two 
kinds of yeast expression plasmid for human P450 2A6, p2A6 and p2A6 Variant 1 , and two kinds of yeast 

25 expression plasmid for simultaneous expression of human P450 2A6 and yeast NADPH-P450 reductase, 
p2A6R and p2A6R Variant 1. 

Construction of yeast expression plasmids: p2B6 and p2B6R 

30 Fig. 12 shows a method of constructing yeast expression plasmids for human P450 2B6. The entire 
protein coding region of P450 2B6 gene was amplified by the PCR method using the primers shown in Fig. 
3. The resultant fragment was cleaved with Xbal and BamHI and sub-cloned to pUC A. The resulting 
plasmid was partially digested with Hindlll, and inserted into pAAHSN and pAHRR vectors to construct a 
yeast expression plasmid p2B6 for human P450 2B6, and a yeast expression plasrr.id p2B6R for 

35 simultaneous expression of human P450 2B6 and yeast NADPH-P450 reductase. 

Construction of yeast expression plasmids: p2C8 and p2C8R 

Fig. 13 shows a method of constructing yeast expressed plasmids for human P450 2C8. The entire 
40 protein coding region of the P450 2C8 gene was amplified by the PCR method using the primers shown in 
Fig. 3 to yield three kinds of P450 2C8 genes which were different only in a small portion of the nucleotide 
sequence. The resultant fragments were partially digested with Xbal, and sub-cloned to pUC A. The 
fragment was cleaved with Hindlll and inserted into pAAHSN and pAHRR vectors to construct three kinds of 
yeast expression plasmids p2C8 ( p2C8 Variant 1 and p2C8 Variant 2 for human P450 2C8, end three kinds 
45 of yeast expression plasmids, p2C8R, p2C8R Variant 1 and p2C8R Variant 2 for simultaneous expression of 
human P450 2C8 and yeast NADPH-P450 reductase. 

Construction of yeast expression plasmids: p2Cl8 and p2C18R 

so Fig. 14 shows a method of constructing yeast expression plasmids for human P450 2C18. The protein 
coding region of P450 2C18 gene was divided into two segment of about 1.4 kb and about 0.9 kb, then the 
both fragments were amplified by the PCR method using the primers shown in Fig. 3. The amplified 
fragment of about 1.4 kb was cleaved with Pstl and sub-cloned to a pUC A vector. Ths fragment of about 
0.9 kb was incorporated between the Xbal and Pstl sites to ligate the two fragments. After cleaving the 

55 plasmid with Smal. an Xbal linker was introduced. Then an Xbal fragment was prepared and inserted into 
the Xbal site of pUCANX. It was cleaved with Notl and inserted into pAAH5N and pAHRR treated in a 
similar manner with Notl to construct a yeast expression plasmid p2C18 for human P450 2C18, and a yeast 
expression plasmid p2Cl8R for simultaneous expression of human P450 2C18 yeast and NADPH-P450 
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reductase. 

Construction of yeast expression plasmids: p2C19 and p2C19R 

5 Fig. 15 shows a method of constructing yeast expression plasmids for human P450 2C19. Fragments a, 
b and c for the protein coding region of P450 2C19 gene were amplified by the PGR method using the 
primers No. 1, No. 2, No. 3 and No. 4, No.5 and No. 6, and No.5 and No.7 defined by SEQ ID NOs: 39-45, 
respectively. 

Fragments e and f for the protein coding region of human cytochrome P450 2C19 were also amplified 
;o against human cytochrome P450 2C9 gene by the PCR method using the primers No. 8 to 21 having 
nucleotide sequences with some mutations shown by SEQ ID NOs: 46 to 59. A fragment d for the linker 
Nos. 1 and 2 having nucleotide sequences shown by SEQ ID NOs: 60 and 61 was obtained by directly 
synthesizing the DNA to cover the rest of the protein coding region of the human P450 2C19 gene. Thus 
the fragments covering the whole protein coding region of the human cytochrome P450 2C19 were 
75 obtained. 

After the fragments a and b were treated with Xhol and BamHl, and with BamHI and Pstl, both 
fragments were simultaneously inserted between the Xhol and Pstl sites of the Blue Script{ + ). The 
fragment e was treated with Xbal and Xhol and inserted to the Xbal and Xhol sites of the plasmid having 
the fragments a and b'*o give a plasmid having the fragments a, b and e. 

20 After the fragment c was treated with Pstl and Kpnl, the resulting fragment was simultaneously inserted 
with the linker fragment d between the Pstl and EcoRI sites of the Blue Script( + ). The resultant plasmid 
was cut with Pstl and EcoRI to give a fragment containing the fragments c and d. Then this fragment was 
simultaneously inserted between the fragment f treated with EcoRI to the Pstl and Hindi sites of the 
aforementioned plasmid containing the fragment a, b and e. Thus a plasmid having the whole coding region 

25 of the human cytochrome P450 2C19 gene was constructed. The constructed plasmid was cut with Hindlll 
and the resultant fragment was inserted to pAAHSN and pAHRR both of which were treated with Hindlll to 
give a yeast expression plasmid p2C19 for expressing the human P450 2C19 and a yeast expression 
plasmid p2C19R for simultaneous expression of the human P450 2C19 and yeast NADPH-P450 reductase. 

30 SEQ ID NOs and primer Nos. are as follows: 



35 



40 



as 



so 



SEQ 


ID 


No: 39 


Primer No. 


1 


SEQ 


ID 


NO: 


40 


Primer No. 


2 


SEQ 


ID 


NO: 


41 


Primer No. 


5 


SEQ 


ID 


NO: 


42 


Primer No. 


4 


SEQ 


ID 


NO: 


43 


Primer No. 


5 


SEQ 


ID 


NO: 


44 


Primer No. 


6 


SEQ 


ID 


NO: 


45 


Primer No. 


7 


SEQ 


ID 


NO: 


46 


Primer No. 


8 


SEQ 


ID 


NO: 


47 


Primer No. 


9 


SEQ 


ID 


NO: 


48 


Primer No. 


10 


SEQ 


ID 


NO: 


49 


Primer No. 


11 


SEQ 


ID 


NO: 


50 


Primer No. 


12 


SEQ 


ID 


NO: 


51 


Primer No. 


13 


SEQ 


ID 


NO: 


52 


Primer No. 


14 


SEQ 


ID 


NO: 


53 


Primer No. 


15 


SEQ 


ID 


NO: 


54 


Primer No. 


16 


SEQ 


ID 


NO: 


55 


Primer No. 


17 


SEQ 


ID 


NO: 


56 


Primer No. 


18 


SEQ 


ID 


NO: 


57 


Primer No. 


19 


SEQ 


ID 


NO: 


58 


Primer No. 


20 


SEQ 


ID 


NO: 


59 


Primer No. 


21 


SEQ 


ID 


NO: 


60 


Linker No. 


1 


SEQ 


ID 


NO: 


61 


Linker No. 


2 



9 



EP 0 644 267 A2 



Construction of yeast expression plasmids: p2D6 and p2D6R 

Fig. 16 shows a method of constructing yeast expression plasmids for human P450 2D6. The protein 
coding region of 1.3 kb excluding about 200 bp at the 5'-terminal of P450 2D6 gene was divided into two 

5 fragments of about 0.4 kb and about 0.9 kb, and the both fragments were amplified by the PCR method. 
The resultant fragment of aboul0.9 kb was cleaved with Kpnl and sub-cloned to pUC A. For the 200 bp on 
the S'-terminal, three synthetic tinkers shown in Fig. 5 were used and two linkers on the 5'-terminat were 
incorporated into Xbal and Pstl sites of a Blue Script( + ) vector and then other linkers were incorporated 
into Smal and Pstl sites. Then fragment of about 0.4 kb obtained by the PCR method was incorporated into 

70 the Pstl and Hindi sites of the ptasmid and then cleaved with NspV and Xbal. The resultant fragment was 
inserted into the plasmid containing the 0.9 kb fragment to ligate the coding region. This was cleaved with 
Hindlll and inserted into pAAHSN and pAHRR vectors to construct a yeast expression plasmid p2D6 for 
human P450 2D6, and a yeast expression plasmid p2D6R for simultaneous expression of human P450 2D6 
and yeast NADPH-P450 reductase. 

75 Then three kinds of human P450 2D6 gene fragments which were different only in a small portion of the 
nucleotide sequence were obtained in a similar manner as described above and used to construct two kinds 
of yeast expression plasmids for human P450 2D6, p2D6 Variant 1, p2D6 Variant 2 and p2D6 Variant 3, and 
three kinds of yeast expression plasmid 2D6R for simultaneous expression of human P450 2D6 yeast and 
NADPH-P450 reductase/p2D6R Variant 1, p2D6R Variant 2 and p2D6R Variant 3. 

20 

Construction of yeast expression plasmid containing artificial fused enzyme gene 

An expression plasmid was constructed in accordance with Fig. 17. The Xbal-Xhol fragment was 
amplified with plasmid p3A4 by using the primers shown in Fig. 4. On the other hand, the Xhol-Hindlll 

25 fragment of about 2.1 kb was obtained from the plasmid pBFCRI (Japanese Patent Application No. 4- 
209226) and inserted between the Xhol and Hindlll sites of a commercial vector Blue Script( + ), followed by 
digestion with restriction enzymes Xhol and Xbal. These two fragments were simultaneously inserted to the 
Xbal site of the vector pUCAN, which was then digested with Notl to give a fragment of about 5.6 kb. The 
desired yeast expression plasmid pF3A4 was obtained by ligating the fragment with the Notl fragment of 

30 about 10.5 kb obtained from vector pAAH5N (Japanese Patent Laid-open Publication No. 2-211880). The 
artificial fused enzyme consists of 1156 amino acid residues of which sequence structure comprising, 
successively, from the N-terminal end, an entire amino acid sequence (503 residues) of human liver 
cytochrome P450 3A4, a linker-derived sequence (Ala-Arg-Ala), and a sequence of from the 42nd residue to 
C-terminal of yeast NADPH-cytochrome P450 reductase. 

35 

Preparation of transformed yeast cell 

Saccharomyces cerevisiae AH 22 was inoculated to 1.0 ml of YPD culture medium (1% yeast extract, 
2% poly peptone, 2% glucose). After shaken at 30 *C for 18 hours, the yeast cells were collected by 

40 centrifugation (5000 x g, 10 min). The resultant cells were suspended in 10 ml of 0.2 M LiCI solution and 
then centrifuged again (5000 x g, 10 min) to obtain pellets. Then 20 ul of 1 M LiCI solution, 30 ul of 70% 
polyethylene glycol 4000 and each 10 u1 solution containing about 1.0 ug of various kinds of yeast 
expression plasmids for the human P450 molecular species and yeast NADPH-reductase constructed as 
above were added to the resultant pellets. After sufficiently mixing them, they were incubated at 30'C for 

45 one hour and further stirred after the addition of 140 ul of sterilized water. The solution was plated on SD 
synthetic culture medium (2.0% glucose, 0.67% nitrogen base w/o amino acids, manufactured by Difco Co., 
20 ug/ml of histidine, 2.0% agar) and incubated at 30 °c for three days. Then transformed yeast cells 
possessing the yeast expression plasmid described above were selected. In this way. various kinds of yeast 
cells expressing the human P450 molecular species were prepared. 

50 

Quantitative measurement of human P450 expressed in yeast 

Each 200 ml of culture broth of each kind of yeast cells expressing human P450 molecular species and 
yeast NADPH-reductase simultaneously or expressing an artificial fused enzyme comprising human P45t) 
55 molecular species and yeast NADPH-reductase prepared as above (SD synthetic culture medium, cell 
concentration: about 1.5 x 10 7 cells/ml) was used to collect the cells. The collected cells were then 
suspended in 10 ml of 100 mM potassium phosphate buffer solution (pH 7.0) and centrifuged (5000 x g, 10 
min) to obtain pellets. Thus obtained pellets were resuspended in 2.0 ml of 100 mM potassium phosphate 
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buffer solution (pH 7.0) and 1 ml of each of the solutions were poured into two cuvettes. After bubbling 
carbon monoxide to a sample cuvette, 5 to 10 mg of dithionrte was added to both of the cuvettes, and 
stirred and then difference spectrum at 400-500 nm was measured to calculate the concentration of P450 
present in the yeast. The amount of each kind of human P450 species or an artificial fused enzyme in each 
5 kind of transformed yeast cells was at a level from about 10 s to about 10 s molecules/cell. 

Preparation of yeast S-9 Mix fraction, cytoplasmic fraction and microsomal fraction 

First, 3.8 liter of each kind of culture broth (SD synthetic culture medium, cell concentration: about 1.0 x 
70 10 3 cells/ml) of yeast cells expressing human P450 molecular species and yeast NADPH-reductase 
simultaneously or an artificial fused enzyme comprising human P450 molecular species and yeast NADPH- 
reductase prepared as above was collected and the resultant cells were suspended in 400 ml of a buffer 
solution A (10 mM Tris-HCI (pH 7.5), 2 M sorbitol. 0.1 mM DTT, 0.2 mM EDTA). to which 160 mg of 
Zymolyase 100,000 (Zymolyase 100T) was added, and the obtained solution was incubated at 30 *C for 60 
15 min. Spheroplast obtained by centrifugation (5000 x g, 10 min) was suspended in 100 ml of the buffer 
solution A and then centrifuged (5000 x g, 10 min). Washing the spheroplast by repeating the same 
centrifugal operation once again, the spheroplast was finally suspended in 200 ml of a buffer solution (10 
mM Tris-HCI (pH 7.5), 0.65 M sorbitol. 0.1 mM DTT), which was then subjected to ultrasonic pulverization 
(50 W, for 5 min). The cell free extracts were centrifuged (9000 x g, 20 min) and supernatants were 
20 recovered to obtain a yeast S-9 Mix fraction. Further, the fraction was centrifuged (125,000 x g, 70 min) to 
collect precipitates which were suspended again into 10 ml of 0.1 M potassium phosphate buffer solution 
(pH 7.4) to obtain a microsomal fraction. On the other hand, a cytoplasmic fraction was obtained by 
recovering the supernatants. 

25 Construction of yeast expression plasmid using GAPDH promoter and its expression in yeast 

Fig. 18 shows a method of constructing a yeast expression plasmid using a GAPDH promoter. A Hindlll 
fragment (about 3.0 kb) obtained from pARRN (described in the Japanese Patent Laid-open Publication No. 
2-211880) was inserted into a Hindlll site of plasmid pUN, which was obtained by cleaving pUC19 with 

30 EcolRI, blunt-ending and ligation with an Notl linker to give pUR. On the other hand, after blunting an Xhol 
site of plasmid pAAHS and inserting an Xbal linker, it was cleaved with restriction enzymes Xbal and Sail 
and the resultant fragment (about 2.2 kb) was inserted to Xbal and Sail sites of pUC19. The three 
fragments, namely, a fragment (about 2.2 kb) obtained by cleaving the resultant plasmid with Xbal and Pstl, 
the Xbal-Pstl fragment (about 1,3 kb) cut out from 2 urn DMA of Saccharomyces cerevisiae AH22, and a 

35 fragment obtained by cleaving pUR with Pstl were iigated to give a plasmid pURL. Further, the pURL was 
cleaved with Hindlll, blunted and Iigated to remove the Hindlll site. Then, an Notl fragment (about 1.6 kb) 
containing GAPDH promoter and terminator (obtained by the method as described in Agric. Biol. Chem., 51, 
1641-1647 (1987) and J. Biol, Chem., 267, 16497-16502 (1992)) was Iigated to the Notl site of pURL to give 
a plasmid pURLG. Human P450 2D6 cDNA obtained by the method used for the construction of p2D6 was 

40 inserted to a Hindlll site of pURLG to obtain a yeast expression plasmid pG2D6R for simultaneous 
expression of human P450 2D6 and yeast NADPH-P450 reductase. When the plasmid was introduced by 
the method used in the preparation of transformed yeast cells as above to Saccharomyces cerevisiae AH22, 
production of human P450 2D6 was observed. 

45 Metabolism of 7-ethoxycoumarin using transformed yeast cells 

7-Ethoxycoumarin was added to each 2 ml of the culture media of the transformed yeast cells 
expressing (i) human cytochrome P450 molecular species and yeast NADPH-P450 reductase; or (ii) an 
artificial fused enzyme comprising human cytochrome P450 molecular species and yeast NADPH-P450 

so reductase (SD synthetic culture medium, cell concentration: about 2.0 x 10 7 cells/ml) so that the final 
concentration of 7-ethoxycoumarin was 0.5 mM. After incubation at 30 * C for 2 or 5 hours, supernatants 
were obtained by centrifugation (5000 x g, 10 min). To the supernatants 62.5 u\ of 15% TCA (trichloroacetic 
acid) and 2 ml of chloroform were added and, after well stirring, a chloroform layer was recovered by 
centrifugation (5000 x g, 10 min), to which 4 ml of 0.01 N NaOH containing 0.1 M NaCI was added 'and 

55 stirred sufficiently and then centrifuged (5000 x g, 10 min). After recovering the supernatants. fluorescence 
.was measured for the supernatant fraction (ex. 366 nm, em 452 nm) to quantitatively measure the reaction 
product 7-hydroxycoumarin. As a result, O-deethylation activity for 7-ethoxycoumarin can be observed for 
all of 11 kinds of the yeast cells expressing the human P450 molecular species. P450 1A1 and P450 2B6 
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showed strong activity; and P450 1A2, P450 2E1, P450 2A6 and P450 2D6 showed good activity, while 
P450 2C8, P450 2C9, P450 3A4, P450 2C18 and P450 2C19 showed moderate activity. 

Metabolism of tolbutamide using transformed yeast cells 

5 

In the same manner as above, tolbutamide was added to each of the culture solutions of the 
transformed yeast cells expressing (i) human cytochrome P450 molecular species and yeast NADPH-P450 
reductase; or (ii) an artificial fused enzyme comprising human cytochrome P450 molecular species and 
yeast NADPH-P450 reductase so that the concentration of the compound was 1.0 mM. After incubation at 

w 30 'C for 15 hours, the culture supernatant was then obtained by centrifugation (5000 x g, 10 min). To the 
supernatant, 2 ml of dichloromethane was added. After sufficient stirring, the dichloromethane layer was 
recovered by centrifugation (5000 x g, 10 min). and the solvent was evaporated under reduced pressure. 
The resultant residue was dissolved in 100 ul of acetonitrile, and the solution was analyzed by HPLC under 
the following conditions. As a result, hydroxylated tolbutamide was detected in the solution of yeast cells 

is expressing human P450 2C8, P450 2C9, P450 2C18 and P450 2C19. The human P450 2C9 showed high 
activity and 2C19 showed good activity. On the other hand, hydroxylated tolbutamide was not detected in 
the solution of yeast cells expressing other human P450 than described above. 
Conditions for HPLC 

Column: liBondapak C18 (manufactured by Waters Co.) 

20 Carrier: 10-70% acetonitrile-distilled water (linear concentration gradient for 20 min) 

Temperature: 50 *C 

Detection: UV 230 nm 

Injection amount: 50 ul 

25 Metabolism of testosterone using transformed yeast cells 

In the same manner as above, testosterone was added to each of the culture solutions of the 
transformed yeast cells expressing (i) human cytochrome P450 molecular species and yeast NADPH- ; P450 
reductase; or (ii) an artificial fused enzyme comprising human cytochrome P450 molecular species and 

30 yeast NADPH-P450 reductase so that the concentration of the compound was 0.05 mM. After incubation at 
30 *C for 15 hours, the supernatant was obtained by centrifugation (5000 x g, 10 min). Then 2 ml of 
dichloromethane was added. After sufficient stirring, the solution was centrifuged again (5000 x g, 10 min). 
The dichloromethane layer was recovered from the separated layer and the solvent was evaporated under 
reduced pressure. The resultant residue was dissolved in 100 ul of acetonitrile, and the solution was 

35 analyzed by HPLC under the following conditions. As a result, hydroxylated testosterone was detected for 
yeast cells expressing human P450 1A1, P450 2C8 and P450 3A4. On the other hand, hydroxylate 
testosterone was not detected for yeast ceils expressing other human P450 than described above. 
Conditions for HPLC 

Column: uBondapak C18 (manufactured by Waters Co.) 

40 Carrier: 20-70% acetonitrile-distilled wc 1 .?: ( ,: ?e?.r c?~c?^r2tion g-?d : ?^ ^ ~'"] 

Temperature: 50 "C 

Detection: UV 254 nm 

Injection amount: 50 ul 

45 Metabolism of chlorzoxazone using transformed yeast cells and microsomal fractions thereof 

Chlorzoxazone was added to each of the culture solutions of the transformed yeast cells expressing (i) 
human cytochrome P450 molecular species and yeast NADPH-P450 reductase; or (ii) an artificial fused 
enzyme comprising human cytochrome P450 molecular species and yeast NADPH-P450 reductase as 
above so that the concentration of the compound was 0.5 mM. After incubation at 30 'C for 15 hours, the 
supernatant was obtained by centrifugation (5000 x g, 10 min). Then 2 ml of dichloromethane was added to 
the supernatant and vigorously stirred and centrifuged (5000 x g, 10 min). The dichloromethane layer was 
recovered from the separated layer, then evaporated under reduced pressure. The obtained residue was 
dissolved in 100 ul of acetonitrile. and the solution was analyzed by HPLC under the following conditions. 

NADPH and chlorzoxazone were added to a microsomal fraction of yeasts expressing (i) human 
cytochrome P450 molecular species and yeast NADPH-P450 reductase; or (ii) an artificial fused enzyme 
comprising human cytochrome P450 molecular species and yeast NADPH-P450 reductase prepared as 
above so that the concentrations of NADPDH and chlorzoxazone were 0.5 mM and 250 uM. Then the 



50 



55 



-4 2 



EP 0 644 267 A2 



10 



solutions were incubated at 37 *C for 10 min. After that, trichloroacetic acid was added to the solutions so 
that the concentration of the trichloroacetic acid was about 10% (v/v). Then 2 ml of dichloromethane was 
added to the solution, and the solution was stirred vigorously and centrifuged (15,000 x g, 5 min). The 
dichloromethane layer was recovered, and the solvent was removed under reduced pressure. The obtained 
residue was dissolved in 100 ul of acetonitrile and the solution was subjected to analysis by HPLC under 
the same conditions as above,. 

All of the yeast cells expressing eleven human P450 molecular species gave hydroxylated chlorzox- 
azone. P450 2E1 showed high activity, and P450 1A1. P450 1A2. P450 2A6. P450 2D6 showed good 
activity, while P450 2C8, 2C9. 2B6, 2C18, 2C19 and 3A4 showed moderate activity. 

Ames test using yeast S-9 Mix fraction and microsomal fraction 



The Ames test method was in accordance with the customary method described, for example, in Mutat. 
Res., (1975) 31, 347. 2-Aminoanthrathene which is an arylamine type compound was used as a specimen 

is compound. (1) Rat S-9 Mix supernatant fraction (obtained by homogenizing liver and then subjected to 
centrifugation (9000 x g, 10 min), manufactured by Kikkoman) containing each kind of rat P450 molecular 
species at the concentration of 1200 pmol per 1 sample and (2) Yeast S-9 Mix fraction obtained from each 
kind of yeast cells expressing human P450 or a microsomal fraction prepared from the yeast S-9 Mix 
fraction were used as ; a metabolic activation source in the Ames test. As a result, more than 1000 revertant 

20 colonies were detected for the compound at 1 ug/plate (90 mm dia.) only in the case of using the yeast S-9 
Mix fraction obtained from the yeast cells expressing human P450 1A2 (Saccharomyces cerevisiae 
AH22/p1A2R) and yeast cells expressing human P450 2E1 (Saccharomyces cerevisiae AH22/p2E1R) and a 
microsomal fraction prepared from the yeast S-9 Mix fraction, while the amounts of the human P450 
molecules of these fractions were only one five hundredth and one thirtieth of the human P450 molecules 

25 present in the Rat S-9 mixture. 

The human cytochrome P450 1A2 showed high activity, and human P450 2E1 showed only moderate 
activity. But the revertant colonies were not found for the human cytochrome P450 3A4, 2C8 and 2A6. 
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Metabolism of acetanilide using transformed yeast cells 



Acetanilide was added to each of the culture solutions of the transformed yeast cells expressing (i) 
human cytochrome P450 molecular species and yeast NADPH-P450 reductase; or (ii) an artificial fused 
enzyme comprising human cytochrome P450 molecular species and yeast NADPH-P450 reductase, so that 
the concentration of the compound was 5 mM, and the solution was incubated at 30 *C for 15 hours. Then 
35 the solution was centrifuged (5000 x g, 10 min) to give a supernatant. The obtained supernatant solution 
was subjected to the HPLC analysis under the following conditions. The hydroxylated acetanilide was found 
for all of the tested eleven human P450 molecular species. 

Among them, P450 1A2 and 2D6 showed high activity and P450 1A1, 2A6, 2B6, 2C8, 2C9, 2C18, 2C19 
and 2E1 showed good activity, while 3A4 showed moderate activity. 
40 Conditions for HPLC 

Column: uBondapak C18 (manufactured by Waters Co.) 

Carrier: Methanokwatenacetic acid = 15:84:1 

Temperature: 30 0 C 

Detection: UV 254 nm 

45 Injection amount: 50 ul 



Metabolism of coumarin using transformed yeast cells 



Coumarin was added to 6 ml of each of the culture solutions (SDS synthetic culture medium, cell 
so concentration of about 2.0 x 10 7 cells/ml) of the transformed yeast ceils expressing (i) human cytochrome 
P450 molecular species and yeast NADPH-P450 reductase; or (ii) an artificial fused enzyme comprising 
human cytochrome P450 molecular species and yeast NADPH-P450 reductase prepared as above, so that 
the concentration of the compound was 5 mM, and the solution was incubated at 30 'C for 2 or 5 hours. 
Then the solution was centrifuged (5000 x g, 10 min) to give a supernatant. 62.5 u.l of 15% trichloroacetic 
55 acid and 2 ml of chloroform were added to the obtained supernatant solution, and the resultant solution was 
stirred well. The chloroform layer was recovered from the separated layer. Then 4 ml of sodium hydroxide 
solution containing 0.1 M NaCI was added to the solution and centrifuged again (5000 x g, 10 min). The 
supernatant fraction was recovered and subjected to fluorescence analysts (ex. 366 nm, em. 452 nm) to 
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measure the 7-hydroxycoumarin formed. The hydroxylation activity was specifically found only for the yeast 
cells expressing the human P450 2A6. while other yeast cells showed no activity. 



Metabolism of debrisoquine using the microsomal fraction of transformed yeast whole cells 

NADPDH and [ ,4 C]debrisdquine were added to each microsomal fraction solution of (i) human 
cytochrome P450 molecular species and yeast NADPH-P450 reductase; or (ii) an artificial fused enzyme 
comprising human cytochrome P450 molecular species and yeast NADPH-P450 reductase prepared as 
above, so that the concentration of the compound was 100 U.M and that of NADPH is 6 mM, and the 
solution was incubated at 30 'C for 30 minutes. Then perchlorate was added to the solution, so that the final 
concentration of the perchlorate was 10% (v/v). The solution was sufficiently stirred and centrifuged (15,000 
x g, 15 min) to give the supernatant. The obtained supernatant was subjected to HPLC analysis according 
to the following conditions. 

Microsomal fractions of yeasts expressing P450 1A1 and 2D6 showed good activity for the hydrox- 
ylation of the debrisoquine, while those of yeast cells expressing other human P450 molecular species 
showed no activity. 

Conditions for HPLC 



Column: 
Carrier: 



COSMOSIL 5C18 (manufactured by Nakarai Tesq Co.) 
A(acetonitrile)/B(20mM Sodium Perchlorate, pH = 2.5) 



Time (minute) 


A/B 


0-15 


9/91 


15-30 


9/91-25/75 (linear gradient) 


30-32 


100/0 


32-42 


9/91 



Temperature: 
Detector: 
Injection amount: 



room temperature 
Rl 14 C 
100 ul 



Metabolism of S-rnephenytoin using the microsomal fraction of transformed yeast cells 

NADPH and [ u C]S-mephenytoin were added to each microsomal fraction solution of (i) human 
cytochrome P450 molecular species and yeast NADPH-P450 reductase; or (ii) an artificial fused enzyme 
comprising human cytochrome P450 molecular species and yeast NADPH-P450 reductase prepared as 
above, so that the concentration of the compound was 25 uM and that of NADPH was 3 mM, and the 
solution was incubated at 30 # C for 30 minutes. Then the solution was diluted with equal volume of 
methanol, sufficiently stirred and centrifuged (15,000 x g, 5 min) to give the supernatant. The obtained 
supernatant was subjected to HPLC analysis according to the following conditions. 

Microsomal fractions of yeasts expressing P450 2C19 showed good activity for the hydroxylation of the 
S-mephenytoin, while those of yeast cells expressing other human P450 molecular species showed no 
activity. 

Conditions for HPLC 

Column: COSMOSIL 5C18 (manufactured by Nakarai Tesq Co.) 

Carrier: A:(MethanoI)'(20 mM Potassium phosphate buffer, pH = 7.0) = 40/60 

B:Methanol 



Time (minute) 


A/B 


0-18 

18-20 
20-35 


100/0 
0/1OQ - 
100/0 



Temperature: 
Detector: 

Specimen amount: 



room temperature 
Rl 14 C 
100 a\ 
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55 Metabolism of chlorzoxazone using a mixture of microsomal fractions of transformed yeast cells 

Microsomal fractions of yeast expressing cytochrome P450 prepared as above were mixed in the 
following motar ratios, and the hydroxylation activities of the mixed solutions were measured using 
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chlorzoxazone. 



5 



JO 



75 



P450 


System A 


System B 


3A4 


35% 


33% 


2C9 


25% 


5.8% 






5 ft% 


2C18 




5.8% 


2C19 




5.8% 


1A2 


23% 


19% 


2E1 


17% 


15% 


1A1 




2.4% 


2A6 




3.0% 


2B6 




2.4% 


2D6 




2.4% 



The substrate, [ u ^chlorzoxazone and NADPH were added to the mixed yeast microsomal fractions, so 
that the concentrations pf the compound and NADPH were 382 uM and 3 mM. The solutions were 
incubated at 37 " C for 30 min, and then 1 ml of dichloromethane was added thereto to stop the reaction. 
After stirring, dichloromethane layer was recovered by centrifugation (10,000 x g, 5 min). Then the solvent 
was evaporated by the stream of nitrogen gas. The obtained residue was dissolved in 54 ul of acetonitrile 
and 146 ul of water, the solution was subjected to HPLC analysis under the following conditions. 

Conditions for HPLC 

Column: COSMOSIL 5C18 (manufactured by Nakarai Tesq Co.) 

Carrier: A(Acetonitrile/Water = 27/73) 

B(Acetonitrile) 



Time (minute) 


( A/B 


0-15 
15-17 
17-25 


100/0 
0/100 
100/0 



35 Temperature: room temperature 

Detector: Rl U C 

Injection amount: 100 ul 

The metabolites of chlorzoxazone observed by each of the mixed systems A and B were simitar to 
those metabolites which Guengerich reported based on their experimental results by using human liver 
40 microsomal fractions (Guengerich, F.P., Chem. Toxicil., Vol.3, pp.566-573, 1990). 

Furthermore, the metabolic turnover numbers were calculated for the human liver microsomal fraction 
(by Guengerich) and for the present yeast microsomal fractions. 

The turnover numbers were calculated to be 1.8 and 1.6 in the mixed systems A and B, respectively. 
The turnover V for the human liver microsomal fraction was calculated using V max , K m and substrate 
45 concentration [S] described in the literature according to the following manner. The results are shown in 
Table 2. The values somewhat varied due to the difference of individuals, the lowest value being 1.0 and 
the highest value being 5.9. The values of V for the mixed system B and A fell within this range, both of 
which were the same level. It was confirmed that the four kinds of molecular species in system A can well 
reproduce the metabolic system in human liver in vitro, 
so A turnover V for human cytochrome P450 at an optional substrate concentration can be calculated by 
substituting V mox and K m described in the literature and substrate concentration [S] of the present example 
into the Michaelis-Menten's equation: 

V = (V max '[S])/(K m + [S]) 

55 
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Table 2 



Liver sample 


Metabolic turnover V [product nmol/nmol 
P450/min] 


#1001 


5.9 


KDL 14 


2.2 


KOL 21 


1.7 


KDL 23 


3.0 


KDL 27 


5.0 


h in 


I . I 


H 11 


1.0 


H 12 


4.2 


H 13 


3.3 


H 14 


2.1 


H 15 


4.3 


H 16 


4.0 


H 17 


3.6 


S H 18 


3.4 


Designations of the human liver sample were those used by Guengerich. 



Metabolism of debrisoquine using mixture of microsomal fractions of transformed yeast cells 

Microsomal fractions of yeasts expressing human cytochrome P450 were mixed, and the hydroxylation 
activity of the mixed fraction was measured using debrisoquine. The mixing molar ratio, of the human 
cytochrome P450 molecular species were as follows: 



30 



P450 


Molar ratio 


3A4 


33% 


2C9 


5.8% 


2C8 


5.8% 


2C18 


5.8% 


2C19 


5.8% 


1A2 


19% 


2E1 


15% 


1A1 


2.4% 


2B6 


2.4% 


2D6 


2.4% 



The substrate debrisoquine and NADPH were added to the mixed microsomal fraction solutions, so that 
the concentrations were 100 uM for the NADPH and 6 mM for the compound. After the mixture was 
incubated at 37* C for 30 min p 50 ul of 60% perchlorate was added to the solution to stop the reaction. The 
concentration of the perchlorate was finally 12.5% (v/v). After vigorous stirring, the mixture was centrifuged 
{15,000 x g, 5 min) to recover the supernatant, which was subjected to HPLC analysis under the same 
conditions used for analyzing the metabolites of debrisoquine. 

The metabolites well coincided with the metabolites which Kronbach reported based on the experiments 
to metabolize the debrisoquine using the human liver microsome (Methods in Enzymology, Vol.206, pp. 509- 
517, 1991). 

Metabolism of S-mephenytoin using mixture of microsomal fractions of transformed yeast cells ( . 

Microsomal fractions of yeasts expressing various human cytochrome P450 prepared were mixed, and 
the hydroxylation activity of the mixed fraction was measured for S-mephenytoin. The mixing ratio of the 
human cytochrome P450 molecular species was the same as that of the mixing system B as described 
above. 



45 



50 
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The substrate, [ 14 C]S-mephenytoin and NADPH were added to the mixed microsomal fraction solutions, 
so that the concentrations were 28 uM for the NADPH and 6 mM for the compound. After the mixture was 
incubated at 37 *C for 30 min, 250 ul of methanol was added to the solution to stop the reaction. After 
vigorous stirring, the mixture was centrifuged (15,000 x g, 5 min) to recover the supernatant, which was 
subjected to HPLC analysis under the same conditions used for the hydroxylation of S-mephenytoin using 
microsomal fraction. The metabolites obtained well coincided with the metabolites which Goldstein reported 
based on the experiments to metabolize the S-mephenytoin using the human liver microsome (Biochem- 
istry, Vol.33, pp.1 743-1 752, 1994). 



10 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



,5 (i) APPLICANT: 

(A) NAME: Sumitomo Chemical Company, Limited 

(B) STREET: 5-33, Kitahama 4-chome, Chuo-ku, 

(C) CITY: Osaka-shi, Osaka-fu 

(E) COUNTRY: Japan 

(F) POfeTAL CODE (ZIP) : none 



20 



25 



30 



35 



(ii) TITLE OF INVENTION: METHOD FOR SAFETY EVALUATION OF CHEMICAL 
COMPOUND USING RECOMBINANT YEAST EXPRESSING HUMAN 
CYTOCHROME P4 50 

(iii) NUMBER OF SEQUENCES: 61 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(3) COMPUTER: IBM PC compatible •;. 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUT-IB ER : JP 201120/1993 
(3) FILING DATE: 20-JUL-1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUM3ER : JP 180246/1993 
(3) FILING DATE: 21-JUL-1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 208279/1993 

(B) FILING DATE: 30-JUL-1993 



40 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1. .1548 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG GCA TTG TCC CAG TCT GTT CCC TTC TCG GCC ACA GAG CTC CTC CTG 4 8 

Met Ala Leu Ser Gin Ser Val Pro Phe Ser Ala Thr Glu Leu Leu Leu 
15 10 15 
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GCC TCT GCC ATC TTC TGC CTG GTA TTC TGG GTG CTC AAG GGT TTG AGG 96 

Ala Ser Ala He Phe Cys Leu Val Phe Trp Val Leu Lys Gly Leu Arg 
20 25 30 

5 CCT CGG GTC CCC AAA GGC CTG AAA AGT CCA CCA GAG CCA TGG GGC TGG 14 4 

Pro Arg Val Pro Lys\Gly Leu Lys Ser Pro Pro Glu Pro Trp Gly Trp 
35 40 45 

CCC TTG CTC GGG CAT GTG CTG ACC CTG GGG AAG AAC CCG CAC CTG GCA 192 

Pro Leu Leu Gly His Val Leu Thr Leu Gly Lys Asn Pro His Leu Ala 
70 50 55 60 

CTG TCA AGG ATG AGC CAG CGC TAC GGG GAC GTC CTG CAG ATC CGC ATT 24 0 

Leu Ser Arg Met Ser Gin Arg Tyr Gly Asp Val Leu Gin Xle Arg He 
65 70 75 80 



;5 



20 



25 



30 



GGC TCC ACG CCC GTG CTG GTG CTG AGC CGC CTG GAC ACC ATC CGG CAG 288 
Gly Ser Thr Pro Val Leu Val Leu Ser Arg Leu Asp Thr He Arg Gin 
85 90 95 

GCC CTG GTG CGG CAG GGC GAC GAT TTC AAG GGC CGG CCT GAC CTC TAC 33 6 

Ala Leu Val Arg^Gln Gly Asp Asp Phe Lys Gly Arg Pro Asp Leu Tyr 
100 105 110 

ACC TCC ACC CTC ATC ACT GAT GGC CAG AGC TTG ACC TTC AGC ACA GAC 384 
Thr Ser Thr Leu He Thr Asp Gly Gin Ser Leu Thr Phe Ser Thr Asp 
115 120 125 

TCT GGA CCG GTG TGG GCT GCC CGC CGG CGC CTG GCC CAG AAT GCC CTC 43 2 

Ser Gly Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Ala Leu 
130 135 140 

AAC ACC TTC TCC ATC GCC TCT GAC CCA GCT TCC TCA TCC TCC TGC TAC ' 48 0 

Asn Thr Phe Ser He Ala Ser Asp Pro Ala, Ser Ser Ser Ser Cys Tyr 
145 150 155 160 

CTG GAG GAG CAT GTG AGC AAG GAG GCT AAG GCC CTG ATC AGC AGG TTG 52 8 

Leu Glu Glu His Val Ser Lys Glu Ala Lys Ala Leu He Ser Arg Leu 
165 170 175 

CAG GAG CTG ATG GCA GGG CCT GGG CAC TTC GAC CCT TAC AAT CAG GTG 576 
Gin Glu Leu Met Ala Gly Pro Gly His Phe Asp Pro Tyr Asn Gin Val 
180 185 190 

GTG GTG TCA GTG GCC AAC GTC ATT GGT GCC ATG TGC TTC GGA CAG CAC 624 

Val Val Ser Val Ala Asn Val lie Gly Ala Met Cys Phe Gly Gin Kis 
195 200 205 

AO 

TTC CCT GAG AGT AGC GAT GAG ATG CTC AGC CTC GTG AAG AAC ACT CAT 6 72 

Phe Pro Glu Ser Ser Asp Glu Met Leu Ser Leu Val Lys Asn Thr His 
210 " 215 220 

GAG TTC GTG GAG ACT GCC TCC TCC GGG AAC CCC CTG GAC TTC TTC CCC 72 0 

45 Glu Phe Val Glu Thr Ala Ser Ser Gly Asn Pro Leu Asp Phe Phe Pro 
225 230 235 240 

ATC CTT CGC TAC CTG CCT AAC CCT GCC CTG CAG AGG TTC AAG GCC TTC 768 
He Leu Arg Tyr Leu Pro Asn Pro Ala Leu Gin Arg Phe Lys Ala Phe 
245 250 255 



35 
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AAC CAG AGG TTC CTG TGG TTC CTG CAG AAA ACA GTC CAG GAG CAC TAT 816 
Asn Gin Arg Phe Leu Tro Phe Leu Gin Lys Thr Val Gin Glu His Tyr 
260 265 270 

CAG GAC TTT GAC AAG AAC AGT GTC CGG GAC ATC ACG GGT GCC CTG TTC 864 
Gin Asp Phe Asp Lys Asn Ser Val Arg Asp He Thr Gly Ala Leu Phe 
275 N 280 285 

AAG CAC AGC AAG AAG GGG CCT AGA GCC AGC GGC AAC CTC ATC CCA CAG 912 
Lys His Ser Lys Lys Gly Pro Arg Ala Ser Gly Asn Leu He Pro Gin 
290 295 300 

GAG AAG ATT GTC AAC CTT GTC AAT GAC ATC TTT GGA GCA GGA TTT GAC 960 
Glu Lys lie Val Asn Leu Val Asn Asp He Phe Gly Ala Gly Phe Asp 
305 310 315 320 

ACA GTC ACC ACA GCC ATC TCC TGG AGC CTC ATG TAC CTT GTG ACC AAG 1008 
75 Thr Val Thr Thr Ala He Ser Trp Ser Leu Met Tyr Leu Val Thr Lys 

325 330 335 

CCT GAG ATA CAG AGG AAG ATC CAG AAG GAG CTG GAC ACT GTG ATT GGC • 1056 

Pro Glu He Gin &rg Lys He Gin Lys Glu Leu Asp Thr Val He Gly 
340 345 350 

20 

AGG GAG CGG CGG CCC CGG CTC TCT GAC AGA CCC CAG CTG CCC TAC TTG 1104 
Arg Glu Arg Arg Pro Arg Leu Ser Asp Arg Pro Gin Leu Pro Tyr Leu 
355 350 365 

GAG GCC TTC ATC CTG GAG ACC TTC CGA CAC TCC TCC TTC TTG CCC TTC 1152 
25 Glu Ala Phe He Leu Glu Thr Phe Arg His Ser Ser Phe Leu Pro Phe 
370 375 380 

ACC ATC CCC CAC AGC ACA ACA AGG GAC ACA ACG CTG AAT GGC TTC TAC '*" 1200 

Thr He Pro His Ser Thr Thr Arg Asp Thr Thr Leu Asn Gly Phe Tyr 

385 390 l 395 400 

30 

ATC CCC AAG AAA TGC TGT GTC TTC GTA AAC CAG TGG CAG GTC AAC CAT 124 8 

lie Pro Lys Lys Cys Cys Val Phe Val Asn Gin Trp Gin Val Asn His 

405 410 415 

GAC CCA GAG CTG TGG GAG GAC CCC TCT GAG TTC CGG CCT GAG CGG TTC 12 96 

35 Asp Pro Glu Leu Trp Glu Asp Pro Ser Glu Phe Arg Pro Glu Arg Phe 
420 425 430 

CTC ACC GCC GAT GGC ACT GCC ATT AAC AAG CCC TTG AGT GAG AAG ATG 134 4 

Leu Thr Ala Asp Gly Thr Ala He Asn Lys Pro Leu Ser Glu Lys Met 
435 440 445 



40 



45 



50 



ATG CTG TTT GGC ATG GGT AAG CGC CGG TGT ATC GGG GAA GTC CTG GCC 13 92 

Met Leu Phe Glv Met Gly Lys Arg Arg Cys He Gly Glu Val Leu Ala 
450 " 455 460 

AAG TGG GAG ATC TTC CTC TTC CTG GCC ATC CTG CTA CAG CAA CTG GAG 14 4 0 

Lys Trp Glu He Phe Leu Phe Leu Ala He Leu Leu Gin Gin Leu Glu 

465 470 475 480 
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20 



25 



30 



35 



40 



45 



50 



TTC AGC GTG CCG CCG GGC GTG AAA GTC GAC CTG ACC CCC ATC TAC GGG 14 88 

Phe ser Val Pro Pro Gly Val Lys Val Asp Leu Thr Pro lie Tyr Gly 
485 490 495 

CTG ACC ATG AAG CAC GCC CGC TGT GAA CAT GTC CAG GCG CGG CTG CGC 1536 
Leu Thr Met Lys His, Ala Arg Cys Glu His Val Gin Ala Arg Leu Arg 
500 505 510 

TTC TCC ATC AAC TGA 1551 
Phe Ser lie Asn 
515 

(2) INFORMATION FOR SEQ ID NO : 2: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Leu Ser Gin Ser Val Pro Phe Ser Ala Thr Glu Leu Leu Leu 
15 10 15 

Ala Ser Ala lie Phe Cys Leu Val Phe Trp Val Leu Lys Gly Leu Arg 
20 25 30 

Pro Arg Val Pro Lys Gly Leu Lys Ser Pro Pro Glu Pro Trp Gly Trp 

3 5 4 0 4 5 .-;/ 

Pro Leu Leu Gly His Val Leu Thr Leu Gly Lys Asn Pro His Leu Ala 
50 55 60 

Leu Ser Arg Met Ser Gin Arg Tyr Gly Asp Val Leu Gin lie Arg lie 
65 70 75 80 

Gly Ser Thr Pro Val Leu Val Leu Ser Arg Leu Asp Thr He Arg Gin 
85 90 * 95 

Ala Leu Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro Asp Leu Tyr 
100 * 105 ' 110 

Thr Ser Thr Leu lie Thr Asp Gly Gin Ser Leu Thr Phe Ser Thr Asp 
115 120 125 

Ser Gly Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Ala Leu 
130 135 140 

Asn Thr Phe Ser He Ala Ser Asp Pro Ala Ser Ser Ser Ser Cys Tyr 
145 150 155 160 

Leu Glu Glu His Val Ser Lys Glu Ala Lys Ala Leu He Ser Arg Leu 
165 170 175 

Gin Glu Leu Met Ala Gly Pro Gly His Phe Asp Pro Tyr Asn Gin Val 
180 185 190 
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75 



20 



Val Val Ser Val Ala Asn Val He Gly Ala Met Cys Phe Gly Gin His 
195 200 205 

Phe Pro Glu Ser Ser Asd Glu Met Leu Ser Leu Val Lys Asn Thr His 
210 '* 215 220 

C^u ohe Val Glu Thr Ala Ser Ser Gly Asn Pro Leu Asp Phe Phe Pro 
225 230 235 240 

He Leu Arg Tyr Leu Pro Asn Pro Ala Leu Gin Arg Phe Lys Ala Phe 
245 250 255 

Asn Gin Arg Phe Leu Tn> Phe Leu Gin Lys Thr Val Gin Glu His Tyr 
260 265 270 

Gin Asp Phe Asp Lys Asn Ser Val Arg Asp He Thr Gly Ala Leu Phe 
275 280 285 

Lys His Ser Lys Lys Gly Pro Arg Ala Ser Gly Asn Leu He Pro Gin 
290 * 295 300 

•'iiu Lys He Val Asn Leu Val Asn Asp He Phe Gly Ala Gly Phe Asp 
305 310 315 320 

Thr Val Thr Thr Ala He Ser Trp Ser Leu Met Tyr Leu Val Thr Lys 
325 330 335 

Pro Glu He Gin Arg Lys He Gin Lys Glu Leu Asp Thr Val He Gly 
340 345 350 

Arg Glu Arg Arg Pro Arg Leu Ser Asp .Arg Pro Gin Leu Pro Tyr Leu 
355 360 365 

Glu Ala Phe He Leu Glu Thr Phe Arg His Ser Ser Phe Leu Pro Phe 
370 375 380 

Th^- He Pro Kis Ser Thr Thr Arg Asp Thr Thr Leu Asn Gly Phe Tyr 
35 385 390 395 400 

He Pro Lys Lys Cys Cys Val Phe Val Asn Gin Trp Gin Val Asn His 
405 410 415 

Asp Pro Glu Leu Trp Glu Asp Pro Ser Glu Phe Arg Pro Glu Arg Phe 
40 420 425 430 

Leu Thr Ala Asp Gly Thr Ala lie Asn Lys Pro Leu Ser Glu Lys Met 
435 440 445 



25 



30 



45 



50 



Met Leu Phe Gly Met Gly Lys Arg Arg Cys lie Gly Glu Val Leu Ala 
450 455 460 

Lvs T-rp Glu ~le Phe Leu Phe Leu Ala He Leu Leu Gin Gin Leu Glu 
465 470 475 480 

^he Ser Val °ro Pro Gly Val Lys Val Asp Leu Thr Pro lie Tyr Gly 

485 490 495 
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Leu Thr Met Lys His Ala Arg Cys Glu His Val Gin Ala Arg Leu Arg 
500 505 510 

Phe Ser lie Asn 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GAT TCT ATT GTG TCC CTT GTG CTC TGT CTC TCA TGT TTG CTT CTC 
Met Asp Ser lie Val Ser Leu Val Leu Cys Leu Ser Cys Leu Leu Leu 
1 -5 10 15 

CTT TCA CTC TGG AGA CAG AGC TCT GGG AGA GGA AAA CTC CCT CCT GGC 
Leu Ser Leu Trp Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly 
20 25 30 

CCC ACT CCT CTC CCA GTG ATT GGA AAT ATC CTA CAG ATA GGT ATT AAG 
Pro Thr Pro Leu Pro Val He Gly Asn He Leu Gin He Gly He Lys 
35 40 45 

GAC ATC AGC AAA TCC TTA ACC AAT CTC TCA AAG GTC TAT GGC CCT GTG 
Asp He Ser Lys Ser Leu Thr Asn Leu Ser Lys Val Tyr Gly Pro Val 
50 55 60 

TTC ACT CTG TAT TTT GGC CTG AAA CCC ATA GTG GTG CTG CAT GGA TAT 
Phe Thr Leu Tyr Phe Gly Leu Lys Pro He Val Val Leu His Gly Tyr 
65 70 75 30 

GAA GCA GTG AAG GAA GCC CTG ATT GAT CTT GGA GAG GAG TTT TCT GGA 
Glu Ala Val Lys Glu Ala Leu He Asp Leu Gly Glu Glu Phe Ser Gly 
85 90 95 

AGA GGC ATT TTC CCA CTG GCT GAA AGA GCT AAC AGA GGA TTT GGA ATT 
Arg Gly lie Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly lie 
100 105 HO 

GTT TTC AGC AAT GGA AAG AAA TGG AAG GAG ATC CGG CGT TTC TCC CTC 
Val Phe Ser Asn Gly Lys Lys Trp Lys Glu He Arg Arg Phe Ser Leu 
115 120 125 

ATG ACG CTG CGG AAT TTT GGG A.TG GGG AAG AGG AGC ATT GAG GAC CGT 
Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser He Glu Asp Arg 
130 135 " " 140 
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25 



GTT CAA GAG GAA GCC CGC TGC C7T GTG GAG GAG TTG AGA AAA ACC AAG 4 80 

Val Gin Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
145 150 155 160 

GCC TCA CCC TGT GAT CCC ACT TTC ATC CTG GGC TGT GCT CCC TGC AAT 523 
Ala Ser Pro Cys Asp Pro Thr Phe lie Leu Gly Cys Ala Pro Cys Asn 
165 170 175 

GTG ATC TGC TCC ATT ATT TTC CAT AAA CGT TTT GAT TAT AAA GAT CAG 576 
Val lie Cys Ser He He Phe His Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 * 190 

CAA TTT CTT AAC TTA ATG GAA AAG TTG AAT GAA AAC ATC AAG ATT TTG 624 
Gin Phe Leu Asn Leu Met Glu Lys Leu Asn Glu Asn He Lys He Leu 
195 200 205 

AGC AGC CCC TGG ATC CAG ATC TGC AAT AAT TTT TCT CCT ATC ATT GAT 672 
Ser Ser Pro Tro He Gin He Cys Asn Asn Phe Ser Pro lie He Asp 
210 " 215 220 

TAC TTC CCG GGA ACT CAC AAC AAA TTA CTT AAA AAC GTT GCT TTT ATG 720 
Tyr Phe Pro Gly thr His Asn Lys Leu Leu Lys Asn Val Ala Phe Met 
225 230 235 240 

AAA AGT TAT ATT TTG GAA AAA GTA AAA GAA CAC CAA GAA TCA ATG GAC 768 
Lys Ser Tyr He Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp 
245 250 255 

ATG AAC AAC CCT CAG. GAC TTT ATT GAT TGC TTC CTG ATG AAA ATG GAG 816 
Met Asn Asn Pro Gin Asp Phe lie Asp Cys Phe Leu Met Lys Met Glu 
260 265 270 

AAG GAA AAG CAC AAC CAA CCA TCT GAA TTT ACT ATT GAA AGC TTG GAA *' 8 64 
Lys Glu Lys His Asn Gin Pro Ser Glu Phe ,Thr He Glu Ser Leu Glu 
275 280 285 

AAC ACT GCA GTT GAC TTG TTT GGA GCT GGG AC A GAG ACG ACA AGC ACA 912 
Asn Thr Ala Val Asp Leu Phe Gly Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 300 

ACC CTG AGA TAT GCT CTC CTT CTC CTG CTG AAG CAC CCA GAG GTC ACA 96 0 

Thr Leu Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

GCT AAA GTC CAG GAA GAG ATT GAA CGT GTG ATT GGC AGA AAC CGG AGC 1008 

Ala Lys Val Gin Glu Glu lie Glu Arg Val He Gly Arg Asn Arg Ser 

325 330 335 

40 

CCC TGC ATG CAA GAC AGG AGC CAC ATG CCC TAC ACA GAT GCT GTG GTG 10 5 6 

Pro Cys Met Gin Aso Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 

340 " 345 350 

CAC GAG GTC CAG AGA TAC ATT GAC CTT CTC CCC ACC AGC CTG CCC CAT 1104 
45 His Glu Val Gin Arg Tyr He Asp Leu Leu Pro Thr Ser Leu Pro His 
355 360 365 

GCA GTG ACC TGT GAC ATT AAA TTC AGA AAC TAT CTC ATT CCC AAG GGC 1152 

Ala Val Thr Cvs Asp He Lys Phe Arg Asn Tyr Leu He Pro Lys Gly 
370 " 375 380 

50 
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AC A ACC ATA TTA ATT TCC CTG ACT TCT GTG CTA CAT GAC AAC AAA GAA 1200 
Thr Thr lie Leu lie Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu 
385 390 395 " 400 

5 TTT CCC AAC CCA GAG ATG TTT GAC CCT CAT CAC TTT CTG GAT GAA GGT 124 8 

Phe Pro Asn Pro Glu Met Phe Aso Pro His His Phe Leu Asp Glu Gly 
405 410 415 

GGC AAT TTT AAG AAA AGT AAA TAC TTC ATG CCT TTC TCA GCA GGA AAA 12 96 

Gly Asn Phe Lys Lys Ser Lys Tyr Phe Met Pro Phe Ser Ala Gly Lys 
70 420 425 430 

CGG ATT TGT GTG GGA GAA GCC CTG GCC GGC ATG GAG CTG TTT TTA TTC 1344 
Arg He Cys Val Gly Glu Ala Leu Ala Gly Met Glu Leu Phe Leu Phe 
435 440 445 

15 CTG ACC TCC ATT TTA CAG AAC TTT AAC CTG AAA TCT CTG GTT GAC CCA 13 92 

Leu Thr Ser He Leu Gin Asn Phe Asn Leu Lys Ser Leu Val Asp Pro 
450 455 460 

AAG AAC CTT GAC ACC ACT CCA GTT GTC AAT GGA TTT GCC TCT GTG CCG 14 4 0 

Lys Asn Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro 
20 465 " 470 475 480 

CCC TTC TAC CAG CTG TGC TTC ATT CCT GTC TGA 14 73 

Pro Phe Tyr Gin Leu Cys Phe He Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 0 

(A) LENGTH: 4 90 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asp Ser He Val Ser Leu Val Leu Cys Leu Ser Cys Leu Leu Leu 
15 10 15 

Leu Ser Leu Trp Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly 
20 25 30 

Pro Thr Pro Leu Pro Val He Gly Asn He Leu Gin He Gly He Lys 
40 35 40 45 

Asp He Ser Lvs Ser Leu Thr Asn Leu Ser Lys Val Tyr Gly Pro Val 
50 '55 60 

Phe Thr Leu Tyr Phe Gly Leu Lys Pro He Val Val Leu His Gly Tyr 
<5 6 5 ' 70 75 80 

Glu Ala Val Lvs Glu Ala Leu He Asp Leu Gly Glu Glu Phe Ser Gly 
85 90 9o 

50 
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Arg Gly lie ?he Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly lie 
100 105 HO 

Val Phe Ser Asn Gly Lys Lys Trp Lys Glu He Arg Arg Phe Ser Leu 
5 lis ^ 120 125 

Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser He Glu Asp Arg 
130 135 140 

Val Gin Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
'° 145 150 155 160 

Ala Ser Pro Cys Asp Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn 
165 170 175 

Val lie Cys Ser He He Phe His Lys Arg Phe Asp Tyr Lys Asp Gin 
15 180 185 190 

Gin Phe Leu Asn Leu Met Glu Lys Leu Asn Glu Asn He Lys He Leu 
195 . 200 205 



20 



25 



30 



Ser Ser Pro Trp He Gin He Cys Asn Asn Phe Ser Pro He He Asp 
210 215 220 

Tyr Phe Pro Gly Thr His Asn Lys Leu Leu Lys Asn Val Ala Phe Met 
225 230 235 240 

Lys Ser Tyr lie Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp 
245 250 255 

Met Asn Asn Pro Gin Asp Phe He Asp Cys Phe Leu Met Lys Met Glu 
260 265 270 

Lys Glu Lys His Asn Gin Pro Ser Glu Phe Thr He Glu Ser Leu Glu 
275 280 285 

Asn Thr Ala Val Asp Leu Phe Gly Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 300 

Thr Leu Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

Ala Lys Val Gin Glu Glu He Glu Arg Val He Gly Arg Asn Arg Ser 
325 330 335 

Pro Cys Met Gin Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 345 350 

His Glu Val Gin Arg Tyr He Aso Leu Leu Pro Thr Ser Leu Pro His 
355 360 365 

Ala Val Thr Cys Asd He Lys Phe Arg Asn Tyr Leu He Pro Lys Gly 
370 " 375 380 

Thr Thr He L^u He Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu 
385 390 395 400 

50 phe Pro Asn Pro Glu Met Phe Asp Pro His His Phe Leu Asp Glu Gly 

405 ~ 410 415 



35 
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Gly Asn Phe Lys Lys Ser Lys Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 

Arg lie Cys Val Gly Glu Ala Leu Ala Gly Met Glu Leu Phe Leu Phe 
435 , v 440 445 

Leu Thr Ser lie Leu Gin Asn Phe Asn Leu Lys Ser Leu Val Asp Pro 
450 455 460 

Lys Asn Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro 
465 470 475 480 

Pro Phe Tyr Gin Leu Cys Phe He Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 82 base pairs 

(B) TYPE: nucleic acid 

(C) . : STRANDEDNESS : double 

(D) r TOPOLOGY : linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1479 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TCT GCC CTC GGA GTC ACC GTG GCC CTG CTG GTG TGG GCG GCC TTC - 

..-jt Ser Ala Leu Gly Val Thr Val Ala Leu Leu Val Trp Ala Ala Phe 

15 10 15 

CTC CTG CTG GTG TCC ATG TGG AGG CAG GTG CAC AGC AGC TGG AAT CTG 

Leu Leu Leu Val Ser Met Trp Arg Gin Val His Ser Ser Trp Asn Leu 

20 25 30 

CCC CCA GGC CCT TTC CCG CTT CCC ATC ATC GGG AAC CTC TTC CAG TTG 

Pro Pro Gly Pro Phe Pro Leu Pro He He Gly Asn Leu Phe Gin Leu 

35 40 45 

GAA TTG AAG AAT ATT CCC AAG TCC TTC ACC CGG TTG GCC CAG CGC TTC 

Glu Leu Lys Asn He Pro Lys Ser Phe Thr Arg Leu Ala Gin Arg Phe 

50 55 60 

GGG CCG GTG TTC ACG CTG TAC GTG GGC TCG CAG CGC ATG GTG GTG ATG 

Gly Pro Val Phe Thr Leu Tyr Val Gly Ser Gin Arg Met Val Val Met 

65 70 75 80 

CAC GGC TAC AAG GCG GTG AAG GAA GCG CTG CTG GAC TAC AAG GAC GAG 

His Gly Tyr Lys Ala Val Lys Glu Ala Leu Leu Asp Tyr Lys Asp Glu 

85 90 95 

TTC TCG GGC AGA GGC GAC CTC CCC GCG TTC CAT GCG CAC AGG GAC AGG 

Phe Ser Glv Arg Gly Asp Leu Pro Ala Phe His Ala His Arg Asp Arg 

100 105 110 
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GGA ATC ATT TTT AAT AAT GGA CCT ACC TGG AAG GAC ATC CGG CGG TTT 384 
Gly lie lie Phe Asn Asn Gly Pro Thr Trp Lys As? lie Arg Arg Phe 
115 120 125 

TCC CTG ACC ACC CTC CGG AAC TAT GGG ATG GGG AAA CAG GGC AAT GAG 432 
Ser Leu Thr Thr Leu Arg Asn Tyr Gly Met Gly Lys Gin Gly Asn Glu 
130 135 140 

AGC CGG ATC CAG AGG GAG GCC CAC TTC CTG CTG GAA GCA CTC AGG AAG 48 0 

Ser Arg He Gin Arg Glu Ala His Phe Leu Leu Glu Ala Leu Arg Lys 
145 150 155 ISO 

ACC CAA GGC CAG CCT TTC GAC CCC ACC TTC CTC ATC GGG TGC GCG CCC 528 
Thr Gin Gly Gin Pro Phe Asp Pro Thr Phe Leu He Gly Cys Ala Pro 
165 170 175 

,5 TGC AAC GTC ATA GCC GAC ATC CTC TTC CGC AAG CAT TTT GAC TAC AAT 576 
Cys Asn Val He Ala Asp He Leu Phe Arg Lys His Phe Asp Tyr Asn 
180 185 190 

GAT GAG AAG TTT 'CTA AGG CTG ATG TAT TTG TTT AAT GAG AAC TTC CAC 624 
Asp Glu Lys Phe Leu Arg Leu Met Tyr Leu Phe Asn Glu Asn Phe His 
20 195 200 205 

CTA CTC AGC ACT CCC TGG CTC CAG CTT TAC AAT AAT TTT CCC AGC TTT 672 
Leu Leu Ser Thr Pro Trp Leu Gin Leu Tyr Asn Asn Phe Pro Ser Phe 
210 215 220 

25 CTA CAC TAC TTG CCT GGA AGC CAC AGA AAA GTC ATA AAA AAT GTG GCT 72 0 

Leu His Tyr Leu Pro Gly Ser His Arg Lys Val He Lys Asn Val Ala 
225 230 235 240 , ; 

GAA GTA AAA GAG TAT GTG TCT GAA AGG GTG AAG GAG CAC CAT CAA TCT 76 8 

Glu Val Lys Glu Tyr Val Ser Glu Arg Val Lys Glu His His Gin Ser 
30 245 250 255 

CTG GAC CCC AAC TGT CCC CGG GAC CTC ACC GAC TGC CTG CTC GTG GAA 816 
Leu Asp Pro Asn Cys Pro Arg Asp Leu Thr Asp Cys Leu Leu Val Glu 
260 265 270 

35 ATG GAG AAG GAA AAG CAC AGT GCA GAG CGC TTG TAC ACA ATG GAC GGT 8 64 

Met Glu Lys Glu Lys His Ser Ala Glu Arg Leu Tyr Thr Met Asp Gly 
275 280 285 

ATC ACC GTG ACT GTG GCC GAC CTG TTC TTT GCG GGG ACA GAG ACC ACC 912 
He Thr Val Thr Val Ala Asp Leu Phe Phe Ala Gly Thr Glu Thr Thr 
40 290 295 300 



45 



AGC ACA ACT CTG AGA TAT GGG CTC CTG ATT CTC ATG AAA TAC CCT GAG 
Ser Thr Thr Leu Arg Tyr Gly Leu Leu He Leu Met Lys Tyr Pro Glu 
305 310 315 320 

ATC GAA GAG A^G CTC CAT GAA GAA ATT GAC AGG GTG ATT GGG CCA AGC 
He Glu Glu Lys Leu His Glu Glu He Asp Arg Val He Gly Pro Ser 
325 330 335 



960 



1008 



50 
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CGA ATC CCT GCC ATC AAG GAT AGG CAA GAG ATG CCC TAG ATG GAT GCT 1056 
Arg lie Pro Ala lie Lys Asp Arg Gin Glu Met Pro Tyr Met Asp Ala 
340 345 350 

5 GTG GTG CAT GAG ATT CAG CGG TTC ATC ACC CTC GTG CCC TCC AAC CTG 1104 
Val Val His Glu lie Gin Arg Phe He Thr Leu Val Pro Ser Asn Leu 
355 360 365 

CCC CAT GAA GCA ACC CGA GAC ACC ATT TTC AGA GGA TAC CTC ATC CCC 1152 
Pro His Glu Ala Thr Arg Asp Thr He Phe Arg Gly Tyr Leu He Pro 
10 370 375 330 

AAG GGC ACA GTC GTA GTG CCA ACT CTG GAC TCT GTT TTG TAT GAC AAC 1200 
Lys Gly Thr Val Val Val Pro Thr Leu Asp Ser Val Leu Tyr Asp Asn 
385 390 395 400 

'5 CAA GAA TTT CCT GAT CCA GAA AAG TTT AAG CCA GAA CAC TTC CTG AAT 124 8 

Gin Glu Phe Pro Asp Pro Glu Lys Phe Lys Pro Glu Kis Phe Leu Asn 
405 410 415 

GAA AAT GGA AAG TTC AAG TAC AGT GAC TAT TTC AAG CCA TTT TCC ACA 1296 
Glu Asn Gly Lys Phe Lys Tyr Ser Asp Tyr Phe Lys Pro Phe Ser Thr 
20 4 2 0 4 2 5 4 3 0 

GGA AAA CGA GTG TGT GCT GGA GAA GGC CTG GCT CGC ATG GAG TTG TTT 1344 
Gly Lys Arg Val Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe 
435 440 445 

25 CTT TTG TTG TGT GCC ATT TTG CAG CAT TTT AAT TTG AAG CCT CTC GTT 13 92 

Leu Leu Leu Cys Ala He Leu Gin His Phe Asn Leu Lys Pro Leu Val 
450 455 460 

GAC CCA AAG GAT ATC GAC CTC AGC CCT ATA CAT ATT GGG TTT GGC TGT .144 0 

Asd Pro Lys Asp He Asp Leu Ser Pro He His lie Gly Phe Gly Cys 
30 465 470 475 480 

ATC CCA CCA CGT TAC AAA CTC TGT GTC ATT CCC CGC TCA TGA 14 82 

He Pro Pro Arg Tyr Lys Leu Cys Val He Pro Arg Ser 
485 490 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Ala Leu Gly Val Thr Val Ala Leu Leu Val Trp Ala Ala Phe 
15 10 15 

Leu Leu Leu Val Ser Met Trp Arg Gin Val His Ser Ser Trp Asn Leu 
20 " 25 30 
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Pro Pro Giy Pro Phe Pro Leu Pro He He Gly Asn Leu Phe Gin Leu 
35 40 45 

Glu Leu Lys Asn He Pro Lys Ser Phe Thr Arg Leu Ala Gin Arg Phe 
50 t 55 60 

♦ 

Gly Pro Val Phe Thr Leu Tyr Val Gly Ser Gin Arg Met Val Val Met 
65 70 75 80 

His Gly Tyr Lys Ala Val Lys Glu Ala Leu Leu Asp Tyr Lys As? Glu 
85 90 95 

Phe Ser Gly Arg Gly Asp Leu Pro Ala Phe His Ala His Arg Asp Arg 
100 105 110 

Gly He He Phe Asn Asn Gly Pro Thr Trp Lys. Asp He Arg Arg Phe 
115 120 125 

Ser Leu Thr Thr Leu Arg Asn Tyr Gly Met Gly Lys Gin Gly Asn Glu 
130 135 140 

i 

Ser Arg He Gin Arg Glu Ala His Phe Leu Leu Glu Ala Leu Arg Lys 
145 150 155 160 

Thr Gin Gly Gin Pro Phe Asp Pro Thr Phe Leu lie Gly Cys Ala Pro 
165 170 175 

Cys Asn Val lie Ala Aso He Leu Phe Arg Lys His Phe Asp Tyr Asn 
180 " 185 190 

Asp Glu Lys Phe Leu Arg Leu Met Tyr Leu Phe Asn Glu Asn Phe His 
195 200 205 

Leu Leu Ser Thr Pro Trp Leu Gin Leu Tyr Asn Asn Phe Pro Ser Phe 
210 * 215 220 

Leu His Tyr Leu Pro Gly Ser His Arg Lys Val lie Lys Asn Val Ala 
225 230 235 240 

Glu Val Lys Glu Tyr Val Ser Glu Arg Val Lys Glu His His Gin Ser 
245 250 255 

Leu Asp Pro Asn Cys Pro Arg Asp Leu Thr Asp Cys Leu Leu Val Glu 
260 265 270 

Met Glu Lys Glu Lys His Ser Ala Glu Arg Leu Tyr Thr Met Asp Gly 
275 280 285 

He Thr Val Thr Val Ala Asp Leu Phe Phe Ala Gly Thr Glu Thr Thr - 
290 295 300 

Ser Thr Thr Leu Arg Tyr Gly Leu Leu lie Leu Met Lys Tyr Pro Glu 
305 310 315 320 

lie Glu Glu Lys Leu His Glu Glu He Asp Arg Val He Gly Pro Ser 
325 330 335 

Arg lie Pro Ala lie Lys Asp Arg Gin Glu Met Pro Tyr Met Asp Ala 
340 " 345 350 
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Vai Val Kis Glu He Gin Arg Phe He Thr Leu Val Pro Ser Asn Leu 

355 360 365 

Pro His Glu Ala Thr Arg Aso Thr He Phe Arg Gly Tyr Leu He Pro 

5 370 - s 37*5 380 

Lys Gly Thr Val Val Val Pro Thr Leu Asp Ser Val Leu Tyr Asp Asn 

385 390 395 400 

Gin Glu Phe Pro Asp Pro Glu Lys Phe Lys Pro Glu His Phe Leu Asn 

10 405 410 415 

Glu Asn Gly Lys Phe Lys Tyr Ser Asp Tyr Phe Lys Pro Phe Ser Thr 

420 425 430 

Gly Lys Arg Val Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe 

75 435 440 445 

Leu Leu Leu Cys Ala He Leu Gin His Phe Asn Leu Lys Pro Leu Val 

450 455 460 

\sp Pro Lys Asp lie Asd Leu Ser Pro He His lie Gly Phe Gly Cys 

20 465 470 475 480 

He Pro Pro Arg Tyr Lys Leu Cys Val lie Pro Arg Ser 

485 490 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TO POLOGY : linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(3) LOCATION: 1. .1509 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GCT CTC ATC CCA GAC TTG GCC ATG GAA ACC TGG CTT CTC CTG GCT 4 8 

Met Ala Leu He Pro Aso Leu Ala Met Glu Thr Trp Leu Leu Leu Ala 
1 5 10 15 

40 GTC AGC CTG GTG CTC CTC TAT CTA TAT GGA ACC CAT TCA CAT GGA CTT 9 6 

Val Ser Leu Val Leu Leu Tyr Leu Tyr Gly Thr His Ser His Gly Leu 
20 25 30 

TTT AAG AAG CTT GGA ATT CCA GGG CCC ACA CCT CTG CCT TTT TTG GGA 144 
Phe Lys Lys Leu Gly He Pro Gly Pro Thr Pro Leu Pro Phe Leu Gly 
45 2S 4 0 4 5 

AAT ATT TTG TCC TAC CAT AAG GGC TTT TGT ATG TTT GAC ATG GAA TGT 192 

Asn He Leu Ser Tyr His Lys Gly Phe Cys Met Phe Asp Met Glu Cys 

50 ' 55 60 

50 
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CAT AAA AAG TAT GGA AAA GTG TGG GGC TTT TAT GAT GGT CAA CAG CCT 
His Lvs Lys Tyr Gly Lys Val Trp Giy Phe Tyr Asp Gly Gin Gin Pro 
S5 ' 70 75 60 

GTG CTG GCT ATC ACA GAT CCT GAC ATG ATC AAA ACA GTG CTA GTG AAA 
Val Leu Ala lie Thr Asp Pro Asp Met He Lys Thr Val Leu Val Lys 
85 90 95 

GAA TGT TAT TCT GTC TTC ACA AAC CGG AGG CCT TTT GGT CCA GTG GGA 
Glu Cys Tyr Ser Val Phe Thr Asn Arg Arg Pro Phe Gly Pro Val Gly 
100 105 HO 

TTT ATG AAA AGT GCC ATC TCT ATA GCT GAG GAT GAA GAA TGG AAG AGA 
Phe Met Lys Ser Ala He Ser He Ala Glu Asp Glu Glu Trp Lys Arg 
115 120 125 

TTA CGA TCA TTG CTG TCT CCA ACC TTC ACC AGT GGA AAA CTC AAG GAG 
Leu Arg Ser Leu Leu Ser Pro Thr Phe Thr Ser Gly Lys Leu Lys Glu 
130 135 140 

ATG GTC CCT ATC ATT GCC CAG TAT GGA GAT GTG TTG GTG AGA AAT CTG 
Met Val Pro He He Ala Gin Tyr Gly Asp Val Leu Val Arg Asn Leu 
145 150 155 160 

AGG CGG GAA GCA GAG ACA GGC AAG CCT GTC ACC TTG AAA GAC GTC TTT 
Arg Arg Glu Ala Glu Thr Gly Lys Pro Val Thr Leu Lys Asp Val Phe 
165 170 175 

GGG GCC TAC AGC ATG GAT GTG ATC ACT AGC ACA TCA TTT GGA GTG AAC 
Gly Ala Tyr Ser Met Asp Val He Thr Ser Thr Ser Phe Gly Val Asn 
180 185 190 

ATC GAC TCT CTC AAC AAT CCA CAA GAC CCC TTT GTG GAA AAC ACC AAG 
He Asp Ser Leu Asn Asn Pro Gin Asp Pro Phe Val Glu Asn Thr Lys 
195 200 205 

AAG CTT TTA AGA TTT GAT TTT TTG GAT CCA TTC TTT CTC TCA ATA ACA 
Lys Leu Leu Arg Phe Asp Phe Leu Asp Pro Phe Phe Leu Ser He Thr 
210 215 220 

GTC TTT CCA TTC CTC ATC CCA ATT CTT GAA GTA TTA AAT ATC TGT GTG 
Val Phe Pro Phe Leu lie Pro He Leu Glu Val Leu Asn He Cys Val 
225 230 235 240 

TTT CCA AGA GAA GTT ACA AAT TTT TTA AGA AAA TCT GTA AAA AGG ATG 
Phe Pro Arg Glu Val Thr Asn Phe Leu Arg Lys Ser Val Lys Arg Met 
245 250 255 

AAA GAA AGT CGC CTC GAA GAT ACA CAA AAG CAC CGA GTG GAT TTC CTT 
Lys Glu Ser Arg Leu Glu Asp Thr Gin Lys His Arg Val Asp Phe Leu 
260 * 265 270 

CAG CTG ATG ATT GAC TCT CAG AAT TCA AAA GAA ACT GAG TCC CAC AAA 
Gin Leu Met lie Aso Ser Gin Asn Ser Lys Glu Thr Glu Ser His Lys 
275 * 230 285 
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GCT CTG TCC GAT CTG GAG CTC GTG GCC CAA TCA ATT ATC TTT ATT TTT 912 
Ala Leu Ser Asp Leu Glu Leu Val Ala Gin Ser lie lie Phe lie Phe 
290 295 300 

GCT GGC TAT GAA ACC ACG AGC AGT GTT CTC TCC TTC ATT ATG TAT GAA 960 
Ala Gly Tyr Glu Thr Thr Ser Ser Val Leu Ser Phe lie Met Tyr Glu 
305 310 315 320 

CTG GCC ACT CAC CCT GAT GTC CAG CAG AAA CTG CAG GAG GAA ATT GAT 1008 
Leu Ala Thr Kis Pro Asp Val Gin Gin Lys Leu Gin Glu Glu lie Asp 
325 330 335 

GCA GTT TTA CCC AAT AAG GCA CCA CCC ACC TAT GAT ACT GTG CTA CAG 1056 
Ala Val Leu Pro Asn Lys Ala Pro Pro Thr Tyr Asp Thr Val Leu Gin 
340 345 350 

ATG GAG TAT CTT GAC ATG GTG GTG AAT GAA ACG CTC AGA TTA TTC CCA 1104 
Met Glu Tyr Leu Asp Met Val Val Asn Glu Thr Leu Arg Leu Phe Pro 
355 " 3S0 365 

ATT GCT ATG AGA CTT GAG AGG GTC TGC AAA AAA GAT GTT GAG ATC AAT 1152 
lie Ala Met Arg Leu Glu Arg Val Cys Lys Lys Asp Val Glu lie Asn 
370 375 380 

GGG ATG TTC ATT CCC AAA GGG TGG GTG GTG ATG ATT CCA AGC TAT GCT 1200 
Gly Met Phe He Pro Lys Gly Trp Val Val Met He Pro Ser Tyr Ala 
385 390 395 400 

CTT CAC CGT GAC CCA AAG TAC TGG AC A GAG CCT GAG AAG TTC CTC CCT 124 8 

25 Leu His Arg Asp Pro Lys Tyr Trp Thr Glu Pro Glu Lys Phe Leu Pro 

405 " 410 415 

GAA AGA TTC AGC AAG AAG AAC AAG GAC AAC ATA GAT CCT TAC ATA TAC '*' 1296 

Glu Arg Phe Ser Lys Lys Asn Lys Asp Asn He Asp Pro Tyr He Tyr 

420 425 1 430 

30 

AC A CCC TTT GGA AGT GGA CCC AGA AAC TGC ATT GGC ATG AGG TTT GCT 1344 

Thr Pro Phe Gly Ser Gly Pro Arg Asn Cys lie Gly Met Arg Phe Ala 
435 440 445 

CTC ATG AAC ATG AAA CTT GCT CTA ATC AGA GTC CTT CAG AAC TTC TCC 13 92 

Leu Met Asn Met Lys Leu Ala Leu He Arg Val Leu Gin Asn Phe Ser 
450 455 460 

TTC AAA CCT TGT AAA GAA ACA CAG ATC CCC CTG AAA TTA AGC TTA GGA 14 4 0 

Phe Lys Pro Cys Lys Glu Thr Gin lie Pro Leu Lys Leu Ser Leu Gly 
465 470 475 480 

40 GGA CTT CTT CAA CCA GAA AAA CCC GTT GTT CTA AAG GTT GAG TCA AGG 14 88 

Gly Leu Leu Gin Pro Glu Lys Pro Val Val Leu Lys Val Glu Ser Arg 
485 490 495 

GAT GGC ACC GTA AGT GGA GCC TGA ' 1512 

Asp Gly Thr Val Ser Gly Ala 
4o 500 



50 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

Met Ala Leu lie Pro Asp Leu Ala Met Glu Thr Trp Leu Leu Leu Ala 
15 10 15 

Val Ser Leu Val Leu Leu Tyr Leu Tyr Gly Thr His Ser His Gly Leu 
20 25 30 

Phe Lys Lys Leu Gly lie Pro Gly Pro Thr Pro Leu Pro Phe Leu Gly 
35 40 45 

Asn lie Leu Ser Tyr His Lys Gly Phe Cys Met Phe Asp Met Glu Cys 
50 55 60 

His Lys Lys Tyr Gly Lys Val Trp Gly Phe Tyr Asp Gly Gin Gin Pro 
65 70 75 80 

Val Leu Ala lie Thr Asp Pro Asp Met lie Lys Thr Val Leu Val Lys 
25 8 5 9 0 9 5 

Glu Cys Tyr Ser Val Phe Thr Asn Arg Arg Pro Phe Gly Pro Val Gly 
100 105 110 

Phe Met Lys Ser Ala lie Ser lie Aid Glu Asp Glu Glu Tro Lys Arg 
30 115 120 125 

Leu Arg Ser Leu Leu Ser Pro Thr Phe Thr Ser Gly Lys Leu Lys Glu 
130 135 140 



20 



35 



Met Val Pro lie lie Ala Gin Tyr Gly Asp Val Leu Val Arg Asn Leu 

145 150 155 160 

Arg Ara Glu Ala Glu Thr Gly Lys Pro Val Thr Leu Lys Asp Val Phe 
165 170 175 

4Q Gly Ala Tyr Ser Met Asp Val lie Thr Ser Thr Ser Phe Gly Val Asn 

180 185 190 

lie Asp Ser Leu Asn Asn Pro Gin Asp Pro Phe Val Glu Asn Thr Lys. 

1S5 200 205 

45 Lys Leu Leu Arg Phe Asp Phe Leu Asp Pro Phe Phe Leu Ser He Thr 

210 * 215 220 

Val Phe Pro Phe Leu He Pro He Leu Glu Val Leu Asn He Cys Val 

225 230 235 240 

50 phe Pro Arg Glu Val Thr Asn Phe Leu Arg Lys Ser Val Lys Arg Met 

245 250 255 
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Lys Glu Ser Arg Leu Glu Asp Thr Gin Lys His Arg Val Asp Phe Leu 
260 265 270 

Gin Leu Met He Asp Ser Gin Asn Ser Lys Glu Thr Glu Ser His Lys 
275 280 285 

Ala Leu Ser Asp Leu Glu Leu Val Ala Gin Ser He He Phe He Phe 
290 295 300 

Ala Gly Tyr Glu Thr Thr Ser Ser Val Leu Ser Phe lie Met Tyr Glu 
305 310 315 320 

Leu Ala Thr His Pro Asp Val Gin Gin Lys Leu Gin Glu Glu He Asp 
325 330 335 

Ala Val Leu Pro Asn Lys Ala Pro Pro Thr Tyr Asp Thr Val Leu Gin 
340 345 350 

Met Glu Tyri Leu Asp Met Val Val Asn Glu Thr Leu Arg Leu Phe Pro 
355 360 365 

He Ala Met Arg Leu Glu Arg Val Cys Lys Lys Asp Val Glu He Asn 
370 375 380 

Gly Met Phe He Pro Lys Gly Trp Val Val Met He Pro Ser Tyr Ala 
385 390 395 400 

Leu His Arg Asp Pro Lys Tyr Trp Thr Glu Pro Glu Lys Phe Leu Pro 
405 410 '415 

Glu Arg Phe Ser Lys Lys Asn Lys Asp Asn lie Asd Pro Tyr lie Tyr 
420 425 " 430 

Thr Pro Phe Gly Ser Gly Pro Arg Asn Cys lie Gly Met Arg Phe Ala 
435 440 445 

35 Leu Met Asn Met Lys Leu Ala Leu He Arg Val Leu Gin Asn Phe Ser 
450 455 460 

Phe Lys Pro Cys Lys Glu Thr Gin He Pro Leu Lys Leu Ser Leu Gly 
465 470 475 480 



20 
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40 



45 



50 



Gly Leu Leu Gin Pro Glu Lys Pro Val Val Leu Lys Val Glu Ser Arg 
485 490 495 

Asp Gly Thr Val Ser Gly Ala 
500 

(2) INFORMATION FOR SSQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ix) FEATURE : 

(A) NAM 5 /KEY : CDS 

(B) LOCATION: 1..1536 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CTT TTC CCA ATC TCC ATG TCG GCC ACG GAG TTT CTT CTG GCC TCT 4 8 

Met Leu Phe Pro He Ser Met Ser Ala Thr Glu Phe Leu Leu Ala Ser 
15 10 15 

GTC ATC TTC TGT CTG GTA TTC TGG GTA ATC AGG GCC TCA AGA CCT CAG 9S 
Val He Phe Cys Leu Val Phe Trp Val He Axg Ala Ser Arg Pro Gin 
20 25 30 

GTC CCC AAA GGC CTG AAG AAT CCA CCA GGG CCA TGG GGC TGG CCT CTG 144 
/5 Val Pro Lys Gly Leu Lys Asn Pro Pro Gly Pro Trp Gly TrD Pro Leu 
35 40 45 

ATT GGG CAC ATG CTG ACC CTG GGA AAG AAC CCG CAC CTG GCA CTG TCA 192 

He Gly His Met Leu Thr Leu Gly Lys Asn Pro His Leu Ala Leu Ser 
50 *: 55 60 

20 

AGG ATG AGC CAG CAG TAT GGG GAC GTG CTG CAG ATC CGA ATT GGC TCC 24 0 

Arg Met Ser Gin Gin Tyr Gly Asp Val Leu Gin He Arg He Gly Ser 
65 70 75 80 

ACA CCC GTG GTG GTG CTG AGC GGC CTG GAC ACC ATC CGG CAG GCC CTG 288 
Thr Pro Val Val Val Leu Ser Gly Leu Asp Thr lie Arg Gin Ala Leu 
85 90 95 

GTG CGG CAG GGC GAT GAT TTC AAG GGC CGG CCC GAC CTC TAC ACC TTC 3 36 

Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro Asp Leu Tyr Thr Phe 
100 105 . 110 

ACC CTC ATC AGT AAT GGT CAG AGC ATG TCC TTC AGC CCA GAC TCT GGA 3 84 

Thr Leu He Ser Asn Gly Gin Ser Met Ser Phe Ser Pro Asp Ser Gly 
115 120 125 

CCA GTG TGG GCT GCC CGC CGG CGC CTG GCC CAG AAT GGC CTG AAA AGT 43 2 

Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Gly Leu Lys Ser *" 
130 135 140 

TTC TCC ATT GCC TCT GAC CCA GCC TCC TCA ACC TCC TGC TAC CTG GAA 4 80 

Phe Ser He Ala Ser Asp Pro Ala Ser Ser Thr Ser Cys Tyr Leu Glu 
145 150 155 160 

GAG CAT GTG AGC AAG GAG GCT GAG GTC CTG ATA AGC ACG TTG CAG GAG 52 8 

Glu His Val Ser Lys Glu Ala Glu Val Leu He Ser Thr Leu Gin Glu 
165 170 175 

CTG ATG GCA GGG CCT GGG CAC TTT AAC CCC TAC AGG TAT GTG GTG GTA 576 
Leu Met Ala Gly Pro Gly His Phe Asn Pro Tyr Arg Tyr Val Val Val 
_, 45 ~ 180 185 * 190 

TCA GTG ACC AAT GTC ATC TGT GCC ATT TGC TTT GGC CGG CGC TAT GAC 624 

Ser Val Thr Asn Val He Cys Ala He Cys Phe Gly Arg Arg Tyr Aso 

195 200 205 

50 
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CAC AAC CAC CAA GAA CTG CTT AGC CTA GTC AAC CTG AAT AAT AAT TTC 672 
Kis Asa His Gin Glu Leu Leu Ser Leu Val Asn Leu Asa Asn Asn Phe 
210 215 220 

GGG GAG GTG GTT GGC TCT GGA AAC CCA GCT GAC TTC ATC CCT ATT CTT 720 
Gly Glu Val Val Gly* Ser Gly Asn Pro Ala Asp Phe lie Pro He Leu 
225 '230 235 240 

CGC TAC CTA CCC AAC CCT TCC CTG AAT GCC TTC AAG GAC CTG AAT GAG 768 
Arg Tyr Leu Pro Asn Pro Ser Leu Asn Ala Phe Lys Asd Leu Asn Glu 
245 250 * 255 

AAG TTC TAC AGC TTC ATG CAG AAG ATG GTC AAG GAG CAC TAC AAA ACC 816 
Lys Phe Tyr Ser Phe Met Gin Lys Met Val Lys Glu His Tyr Lys Thr 
260 265 270 

TTT GAG AAG GGC CAC ATC CGG GAC ATC ACA GAC AGC CTG ATT GAG CAC 864 
Phe Glu Lys Gly His He Arg Asp He Thr Asp Ser Leu He Glu His 
275 280 285 

TGT CAG GAG AAG CAG CTG GAT GAG AAC GCC AAT GTC CAG CTG TCA GAT 912 
Cys Gin Glu Lys\ Gin Leu Asp Glu Asn Ala Asn Val Gin Leu Ser Asp 
290 * 295 300 

GAG AAG ATC ATT AAC ATC GTC TTG GAC CTC TTT GGA GCT GGG TTT GAC 960 
Glu Lys He He Asn He Val Leu Aso Leu Phe Gly Ala Gly Phe Asp 
305 310 * 315 320 

ACA GTC ACA ACT GCT ATC TCC TGG AGC CTC ATG TAT TTG GTG ATG AAC 1008 
Thr Val Thr Thr Ala He Ser Trp Ser Leu Met Tyr Leu Val Met Asn 
325 330 335 

CCC AGG GTA CAG AGA AAG ATC CAA GAG GAG CTC GAC ACA GTG ATT GGC 1056 
Pro Arg Val Gin Arg Lys He Gin Glu Glu Leu Asp Thr Val He Gly 
340 345 1 350 

AGG TCA CGG CGG CCC CGG CTC TCT GAC AGA TCC CAT CTG CCC TAT ATG 1104 
Arg Ser Arg Arg Pro Arg Leu Ser Asp Arg Ser His Leu Pro Tyr Met 
355 360 365 

GAG GCC TTC ATC CTG GAG ACC TTC CGA CAC TCT TCC TTC GTC CCC TTC " - 1152 
Glu Ala Phe He Leu Glu Thr Phe Arg His Ser Ser Phe Va"> Pro Phe 
370 375 380 

ACC ATC CCC CAC AGC ACA ACA AGA GAC ACA AGT TTG AAA GGC TTT TAC 12 00 

Thr He Pro His Ser Thr Thr Arg Asp Thr Ser Leu Lys Gly Phe Tyr 
385 390 395 400 

ATC CCC AAG GGG CGT TGT GTC TTT GTA AAC CAG TGG CAG ATC AAC CAT 124 8 

He Pro Lys Gly Arg Cys Val Phe Val Asn Gin Trp Gin He Asn His 
405 410 415 

GAC CAG AAG CTA TGG GTC AAC CCA TCT GAG TTC CTA CCT GAA CGG TTT 12 96 

Asp Gin Lys Leu Trp Val Asn Pro Ser Glu Phe Leu Pro Glu Ara Phe 
420 425 430 

CTC ACC CCT GAT GGT GCT ATC GAC AAG GTG TTA AGT GAG AAG GTG ATT 1344 
Leu Thr Pro Asp Gly Ala He Asp Lys Val Leu Ser Glu Lvs Val lie 
435 440 445 
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ATC TTT GGC ATG GGC AAG CGG AAG TGT ATC GGT GAG ACC ATT GCC AGC 13 92 

He Phe Gly Met Gly Lys Arg Lys Cys He Gly Glu Thr He Ala Ser 
450 455 460 

TGG GAG GTC TTT CTC TTC CTG GCT ATC CTG CTG CAA CGG GTG GAA TTC 144 0 

Trp Glu Val Phe Leu Phe Leu Ala He Leu Leu Gin Arg Val Glu Phe 
465 470 475 480 

AGC GTG CCA CTG GGC GTG AAG GTG GAC ATG ACC CCC ATC TAT GGG CTA 1488 
Ser Val Pro Leu Gly Val Lys Val Asp Met Thr Pro He Tyr Gly Leu 
485 " 490 495 

ACC ATG AAG CAT GCC TGC TGT GAG CAC TTC CAA ATG CAG CTG CGC TCT 1536 
Thr Met Lys His Ala Cys Cys Glu His Phe Gin Met Gin Leu Arg Ser 
500 505 510 

TAG 153 9 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 512 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Leu Phe Pro He Ser Met Ser Ala Thr Glu Phe Leu Leu Ala Ser 
15 10 15 

Val He Phe Cys Leu Val Phe Trp Val He Arg Ala Ser Arg Pro Gin 
20 25 » 30 

Val Pro Lys Gly Leu Lys Asn Pro Pro Gly Pro Trp Gly Trp Pro Leu 
35 40 45 

He Gly His Met Leu Thr Leu Gly Lys Asn Pro His Leu Ala Leu Ser 
50 55 60 

Arg Met Ser Gin Gin Tyr Gly Asp Val Leu Gin He Arg He Gly Ser 
65 70 75 80 

Thr Pro Val Val Val Leu Ser Gly Leu Asp Thr He Arg Gin Ala Leu 
85 90 95 

Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro Asp Leu Tyr Thr Phe 
100 " 105 110 

Thr Leu He Ser Asn Gly Gin Ser Met Ser Phe Ser Pro Aso Ser Gly 
115 120 125 

Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Gly Leu Lys Ser 
130 135 140 
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Phe Ser He Ala Ser Asp Pro Ala Ser Ser Thr Ser Cys Tyr Leu Glu 
145 150 155 160 

Glu Kis Val Ser Lys Glu Ala Glu Val Leu He Ser Thr Leu Gin Glu 
; 165 170 175 

Leu Met Ala Gly Pro Gly His Phe Asn Pro Tyr Arg Tyr Val Val Val 
180 185 190 

Ser Val Thr Asn Val He Cys Ala He Cys Phe Gly Arg Arg Tyr Aso 
195 200 205 

His Asn His Gin Glu Leu Leu Ser Leu Val Asn Leu Asn Asn Asn Phe 
210 215 220 

Gly Glu Val Val Gly Ser Gly Asn Pro Ala Asp Phe He Pro lie Leu 
225 230 235 240 

Arg Tyr Leu Pro Asn Pro Ser Leu Asn Ala Phe Lys Asp Leu Asn Glu 
: 245 250 255 . 

i 

Lys Phe Tyr Ser Phe Met Gin Lys Met Val Lys Glu His Tyr Lys Thr 
260 265 270 

Phe Glu Lys Gly His lie Arg Asd lie Thr Asp Ser Leu lie Glu His 
275 280 285 

Cys Gin Glu Lys Gin Leu Asp Glu Asn Ala Asn Val Gin Leu Ser Asp 
290 295 300 

Glu Lys He lie Asn lie Val Leu Asp Leu Phe Gly Ala Glv Phe' ■•Asp 
305 310 315 * 320 

Thr Val Thr Thr Ala He Ser Trp Ser Leu Met Tyr Leu Val Met Asn 
325 330 335 

Pro Arg Val Gin Arg Lys lie Gin Glu Glu Leu Asp Thr Val lie Gly 
340 345 350 

Arg Ser Arg Arg Pro Arg Leu Ser Asp Arg Ser His Leu Pro Tyr Met 
355 360 365 

Glu Ala Phe He Leu Glu Thr Phe Arg His Ser Ser Phe Val Pro Phe 
370 375 380 

Thr lie Pro His Ser Thr Thr Arg Asp Thr Ser Leu Lys Gly Phe Tyr 
385 390 395 400 

lie Pro Lys Gly Arg Cys Val Phe Val Asn Gin Trp Gin lie Asn His 
405 410 415 

Asp Gin Lys Leu Trp Val Asn Pro Ser Glu Phe Leu Pro Glu Arg Phe 
420 425 430 



Leu Thr Pro Asp Gly Ala lie Asp Lys Val Leu Ser Glu Lvs Val lie 

435 4 40 44 5 ^ 

lie Phe Gly Met Gly Lys Arg Lys Cys lie Gly Glu Thr lie Ala Ser 

450 455 460 
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Trp Glu Val Phe Leu Phe Leu Ala He Leu Leu Gin Arg Val Glu Phe 

455 470 475 48C 

Ser Val Pro Leu Gly Val Lys Val Asp Met Thr Pro He Tyr Gly Leu 

485. 490 495 



Thr Met Lys His Ala Cys Cys Glu His Phe Gin Met Glr. Leu Arg Ser 
500 ■ 505 510 



10 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
;s (D) TOPOLOGY: linear 
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45 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1536 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG CTT TTC CCA ATC TCC ATG TCG GCC ACG GAG TTT CTT CTG GCC TCT 4 8 

Net Leu Phe Pro He Ser Met Ser Ala Thr Glu Phe Leu Leu Ala Ser 

15 10 15 

GTC ATC TTC TGT CTG GTA TTC TGG GTA ATC AGG GCC TCA AGA CCT CAG 96 

Val He Phe Cys Leu Val Phe Trp Val He Arg Ala Ser Arg Pro Gin 

20 25 30 0 

GTC CCC AAA GGC CTG AAG AAT CCA CCA GGG \CCA TGG GGC TGG CCT CTG 144 

Val Pro Lys Gly Leu Lys Asn Pro Pro Gly Pro Trp Gly Trp Pro Leu 
35 40 45 

ATT GGG CAC ATG CTG ACC CTG GGA AAG AAC CCG CAC CTG GCA CTG TCA 192 

He Gly His Met Leu Thr Leu Gly Lys Asn Pro His Leu Ala Leu Ser 
50 55 60 

35 

AGG ATG AGC CAG CAG TAT GGG GAC GTG CTG CAG ATC CGA ATT GGC TCC 24 0 

Arg Met Ser Gin Gin Tyr Gly Asp Val Leu Gin He Arg lie Gly Ser 

65 70 75 SO 

AC A CCC GTG GTG GTG CTG AGC GGC CTG GAC ACC ATC CGG CAG GCC CTG 288 

40 Thr Pro Val Val Val Leu Ser Gly Leu Asp Thr He Arg Gin Ala Leu 

85 90 95 

GTG CGG CAG GGC GAT GAT TTC AAG GGC CGG CCC GAC CTC TAC ACC TTC 33 6 

Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro Asp Leu Tyr Thr Phe 

100 105 110 



ACC CTC ATC AGT AAT GGT CAG AGC ATG TCC TTC AGC CCA GAC TCT GGA 384 
Thr Leu He Ser Asn Gly Gin Ser Met Ser Phe Ser Pro Asp Ser Gly 
115 120 125 
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CCA GTG TGG GCT GCC CGC CGG CGC CTG GCC CAG AAT GGC CTG AAA AGT 432 
Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Gly Leu Lys Ser 
130 135 140 

5 TTC TCC ATT GCC TCT GAC CCA GCC TCC TCA ACC TCC TGC TAC CTG GAA 4 80 

Phe Ser lie Ala Ser Asp Pro Ala Ser Ser Thr Ser Cys Tyr Leu Glu 
145 150 155 160 

GAG CAT GTG AGC AAG GAG GCT GAG GTC CTG ATA AGC ACG TTG CAG GAG 528 
Glu His Val Ser Lys Glu Ala Glu Val Leu lie Ser Thr Leu Gin Glu 
J 0 165 170 175 

■ CTG ATG GCA GGG CCT GGG CAC TTT AAC CCC TAC AGG TAT GTG GTG GTA 576 
Leu Met Ala Gly Pro Gly His Phe Asn Pro Tyr Arg Tyr Val Val Val 
180 185 190 

15 TCA GTG ACC AAT GTC ATC TGT GCC ATT TGC TTT GGC CGG CGC TAT GAC 624 
Ser Val Thr Asn Val lie Cys Ala lie Cys Phe Gly Arg Arg Tyr Asp 
195 200 205 

CAC AAC CAC CAA GAA CTG CTT AGC CTA GTC AAC CTG AAT AAT AAT TTC • 672 
His Asn His Gin Glu Leu Leu Ser Leu Val Asn Leu Asn Asn Asn Phe 
20 210 215 220 

GGG GAG GTG GTT GGC TCT GGA AAC CCA GCT GAC TTC ATC CCT ATT CTT 720 
Gly Glu Val Val Gly Ser Gly Asn Pro Ala Asp Phe lie Pro lie Leu 
225 230 235 240 
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CGC TAC CTA CCC AAC CCT TCC CTG AAT GCC TTC AAG GAC CTG AAT GAG 768 
Arg Tyr Leu Pro Asn Pro Ser Leu Asn Ala Phe Lys Asp Leu Asn Glu 
245 250 255 

AAG TTC TAC AGC TTC ATG CAG AAG ATG GTC AAG GAG CAC TAC AAA ACC " .816 
Lys Phe Tyr Ser Phe Met Gin Lys Met Val Lys Glu His Tyr Lys Thr 
260 265 1 270 

TTT GAG AAG GGC CAC ATC CGG GAC ATC ACA GAC AGC CTG ATT GAG CAC 864 
Phe Glu Lys Gly His lie Arg Asp lie Thr Asp Ser Leu He Glu His 
275 280 285 

TGT CAG GAG AAG CAG CTG GAT GAG AAC GCC AAT GTC CAG CTG TCA GAT " 912 
Cys Gin Glu Lys Gin Leu Asp Glu Asn Ala Asn Val Gin Leu Ser Asp 
290 295 300 

GAG AAG ATC ATT AAC ATC GTC TTG GAC CTC TTT GGA GCT GGG TTT GAC 960 
Glu Lys He He Asn He Val Leu Asp Leu Phe Gly Ala Gly Phe Asp 
305 " 310 315 320 

ACA GTC ACA ACT GCT ATC TCC TGG AGC CTC ATG TAT TTG GTG ATG AAC 10 08 

Thr Val Thr Thr Ala lie Ser Trp Ser Leu Met Tyr Leu Val Met Asn 
325 330 335 

CCC AGG GTA CAG AGA AAG ATC CAA GAG GAG CTC GAC ACA GTG ATT GGC 1056 
Pro Arc Val Gin Arg Lys He Gin Glu Glu Leu Asp Thr Val He Gly 
340 345 350 

AGG TCA CGG CGG CCC CGG CTC TCT GAC AGA TCC CAT CTG CCC TAT ATG 1104 
Arg Ser Arg Arg Pro Arg Leu Ser Asp Arg Ser His Leu Pro Tyr Met 
355 360 365 
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GAG GCC TTC ATC CTG GAG ACC TTC CGA CAC TCT TCC TTC GTC CCC TTC 1152 
GIu Ala Phe lie Leu Glu Thr Phe Arc His Ser Ser Phe Val Pro Phe 
370 375 380 

5 ACC ATC CCC CAC AGC • ACA ACA AGA GAC ACA AGT TTG AAA GGC TTT TAC 1200 
Thr lie Pro His Ser Thr Thr Arg Asp Thr Ser Leu Lys Gly Phe Tyr 
335 390 395 400 

ATC CCC AAG GGG CGT TGT GTC TTT GTA AAC CAG TGG CAG ATC AAC CAT 1248 
lie Pro Lys Gly Arg Cys Val Phe Val Asn Gin Trp Gin lie Asn His 
10 405 410 415 

GAC CAG AAG CTA TGG GTC AAC CCA TCT GAG TTC CTA CCT GAA CGG TTT 1296 
Asp Gin Lys Leu Trp Val Asn Pro Ser Glu Phe Leu Pro Glu Arg Phe 
420 425 430 

/5 CTC ACC CCT GAT GGT GCT ATC GAC AAG GTG TTA AGT GAG AAG GTG ATT 1344 
Leu Thr Pro Asp Gly Ala He Asp Lys Val Leu Ser Glu Lys Val He 
435 440 445 

ATC TTT GGC ATG : GGC AAG CGG AAG TGT ATC GGT GAG ACC ATT GCC CGC - 13 92 
He Phe Gly Met Gly Lys Arg Lys Cys He Gly Glu Thr He Ala Arg 
20 450 455 460 

TGG GAG GTC TTT CTC TTC CTG GCT ATC CTG CTG CAA CGG GTG GAA TTC 1440 
Trp Glu Val Phe Leu Phe Leu Ala lie Leu Leu Gin Arg Val Glu Phe 
465 470 475 480 

25 AGC GTG CCA CTG GGC GTG AAG GTG GAC ATG ACC CCC ATC TAT GGG CTA 14 88 

Ser Val Pro Leu Gly Val Lys Val Asp Met Thr Pro He Tyr Gly Leu 
485 490 495 • 

ACC ATG AAG CAT GCC TGC TGT GAG CAC TTC CAA ATG CAG CTG CGC TCT 1536 
Thr Met Lys His Ala Cys Cys Glu His Phe . Gin Met Gin Leu Arg Ser 
500 505 1 510 
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TAG 1539 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Leu Phe Pro He Ser Met Ser Ala Thr Glu Phe Leu Leu Ala Ser 1 
15 10 15 

Val He Phe Cys Leu Val Phe Trp Val He Arg Ala Ser Arc Pro Gin 
20 25 30 

Val Pro Lys Gly Leu Lys Asn Pro Pro Gly Pro Trp Gly Trp Pro Leu 
35 40 45 
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lie Gly His Met Leu Thr Leu Glv Lys Asn Pro 
50 55 

Arg Met Ser Gin Gin Tyr Gly Asp Val Leu Gin 
65 V 70 75 

Thr Pro Val Val Val Leu Ser Gly Leu Asp Thr 
85 90 

Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro 
100 105 

Thr Leu lie Ser Asn Gly Gin Ser Met Ser Phe 
115 120 

Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin 
130 135 

Phe Ser lie Ala Ser Asp Pro Ala Ser Ser Thr 
145 \ 150 155 

Glu His Val Ser Lys Glu Ala Glu Val Leu He 
165 170 

Leu Met Ala Gly Pro Gly His Phe Asn Pro Tyr 
180 185 

Ser Val Thr Asn Val He Cys Ala He Cys Phe 
195 200 

His Asn His Gin Glu Leu Leu Ser Leu Val Asn 
210 215 

Gly Glu Val Val Gly Ser Gly Asn Pro Ala Asp 
225 230 235 



His Leu Ala Leu Ser 
60 

He Arg lie Gly Ser 
80 

He Arg Gin Ala Leu 
95 

Asp Leu Tyr Thr Phe 
110 

Ser Pro Asp Ser Gly 
125 

Asn Gly Leu Lys Ser 
140 

Ser Cys Tyr Leu- Glu 
160 

Ser Thr Leu Gin Glu 
175 

Arg Tyr Val Val Val 
190 

Gly Arg Arg Tyr Asp 
205 

Leu Asn Asn Asn Phe 
220 

Phe He Pro lie Leu. 

240 
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Arg Tyr Leu Pro Asn Pro Ser Leu Asn Ala Phe 
245 250 

Lys Phe Tyr Ser Phe Met Gin Lys Met Val Lys 
260 265 

Phe Glu Lys Gly His lie Arg Asp lie Thr Asp 
275 280 

Cys Gin Glu Lys Gin Leu Asp Glu Asn Ala Asn 
290 295 



Lys Asp Leu Asn Glu 
255 

Glu His Tyr Lys Thr 
270 

Ser Leu He Glu His 
285 

Val Gin Leu Ser Asp 
300 



45 Glu Lys lie lie Asn lie Val Leu Asp Leu Phe 

305 310 315 



Gly Ala Gly Phe Asp 
320 



50 



Thr Val Thr Thr Ala lie Ser Trp Ser Leu Met 
325 * 330 

Pro Arg Val Gin Arg Lys He Gin Glu Glu Leu 
340 345 



Tyr Leu Val Met Asn 
335 

Asp Thr Val lie Gly 
350 
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Arg Ser Arg Arg Pro Arg Leu Ser Asp Arg Ser His Leu Pro Tyr Met 

355 3S0 365 

Glu Ala Phe He Leu Glu Thr Phe Arg His Ser Ser Phe Val Pro Phe 
5 370 375 380 

Thr He Pro His Ser Thr Thr Arg Asp Thr Ser Leu Lys Gly Phe Tyr 
385 390 ' 395 400 

He Pro Lys Gly Arg Cys Val Phe Val Asa Gin Trp Gin He Asn His 
J0 405 410 415 

Asp Gin Lys Leu Trp Val Asn Pro Ser Glu Phe Leu Pro Glu Arg Phe 
420 425 430 
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Leu Thr Pro Asp Gly Ala He Asp Lys Val Leu Ser Glu Lys Val He 
435 440 445 

He Phe Gly Met Gly Lys Arg Lys Cys He Gly Glu Thr He Ala Arg 
450 455 460 

Trp Glu Val Phe ieu Phe Leu Ala lie Leu Leu Gin Arg Val Glu Phe 
-65 4 470 475 480 

Ser Val Pro Leu Gly Val Lys Val Asp Met Thr Pro He Tyr Gly Leu 
485 490 495 

Thr Met Lys Kis Ala Cys Cys Glu His Phe Gin Met Gin Leu Arg Ser 
500 505 510 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 9 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 
35 (A) NAME/KEY: CDS 

(E) LOCATION: 1. .1536 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

40 ATG CTT TTC CCA ATC TCC ATG TCG GCC ACG GAG TTT CTT CTG GCC TCT 4 8 

Met Leu Phe Pro He Ser Met Ser Ala Thr Glu Phe Leu Leu Ala Ser 
15 10 15 

GTC ATC TTC TGT CTG GTA TTC TGG GTA ATC AGG GCC TCA AGA CCT CAG ' 9 6 

Val lie Phe Cys Leu Val Phe Trp Val He Arg Ala Ser Arg Pro Gin 
45 20 25 30 

GTC CCC AAA GGC CTG AAG AAT CCA CCA GGG CCA TGG GGC TGG CCT CTG 14 4 

Val Pro Lys Gly Leu Lys Asn Pro Pro Gly Pro Trp Gly Trp Pro Leu 

35 40 45 

50 
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ATT GGG CAC ATG CTG ACC CTG GGA AAG AAC CCG CAC CTG GCA CTG TCA 192 
He Gly His Met Leu Thr Leu Gly Lys Asn Pro His Leu Ala Leu Ser 
50 55 60 

AGG ATG AGC CAG CAG TAT GGG GAC GTG CTG CAG ATC CGA ATT GGC TCC 24 0 

Arg Met Ser Gin Gin, Tyr Gly Asp Val Leu Gin He Arg He Gly Ser 
65 1 70 75 80 

ACA CCC GTG GTG GTG CTG AGC GGC CTG GAC ACC ATC CGG CAG GCC CTG 288 
Thr Pro Val Val Val Leu Ser Gly Leu Asd Thr He Arg Gin Ala Leu 
85 90 95 

•GTG CGG CAG GGC GAT GAT TTC AAG GGC CGG CCC GAC CTC TAC ACC TTC 336 
Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro Aso Leu Tyr Thr Ph» 
100 105 * no 

ACC CTC ATC AGT AAT GGT CAG AGC ATG TCC TTC AGC CCA GAC TCT GGA 3 84 

Thr Leu He Ser Asn Gly Gin Ser Met Ser Phe Ser Pro Asp Ser GW 
H5 120 125 

CCA GTG TGG GCT GCC CGC CGG CGC CTG GCC CAG AAT GGC CTG AAA AGT 4 32 

Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Gly Leu Lys Ser 
130 ' 135 140 



TTC TCC ATT GCC TCT GAC CCA GCC TCC TCA ACC TCC TGC TAC CTG GAA 
Phe Ser He Ala Ser Asp Pro Ala Ser Ser Thr Ser Cys Tyr Leu Glu 
145 150 155 160 



480 



GAG CAT GTG AGC AAG GAG GCT GAG GTC CTG ATA AGC ACG TTG CAG GAG 52 8 

Glu His Val Ser Lys Glu Ala Glu Val Leu He Ser Thr Leu Gin Glu 
165 170 175 

CTG ATG GCA GGG CCT GGG CAC TTT AAC CCC TAC AGG TAT GTG GTG GTA 576 
Leu Met Ala Gly Pro Gly His Phe Asn Pro Tyr Arg Tyr Val Val Val 
180 185 ; 190 

TCA GTG ACC AAT GTC ATC TGT GCC ATT TGC TTT GGC CGG CGC TAT GAC 6 24 

Ser Val Thr Asn Val He Cys Ala He Cys Phe Gly Arg Arq Tyr Pso 
195 200 205 

CAC AAC CAC CAA GAA CTG CTT AGC CTA GTC AAC CTG AAT AAT AAT TTC * 672 
His Asn His Gin Glu Leu Leu Ser Leu Val Asn Leu Asn Asn Asn ?h* 
210 215 220 

GGG GAG GTG GTT GGC TCT GGA AAC CCA GCT GAC TTC ATC CCT ATT C^T 720 
Gly Glu Val Val Gly Ser Gly Asn Pro Ala Asp Phe He Pro lie Leu 
225 230 235 240 

CGC TAC CTA CCC AAC CCT TCC CTG AAT GCC TTC AAG GAC CTG AAT GAG 7 68 

Arg Tyr Leu Pro Asn Pro Ser Leu Asn Ala Phe Lys Aso L~u Asn Glu 
245 250 " 255 

AAG TTC TAC AGC TTC ATG CAG AAG ATG GTC AAG GAG CAC TAC AAA ACC 816 
Lys Phe Tyr Ser Phe Met Gin Lys Met Val Lys Glu His Tyr Lys Th^ 
260 265 270 

TTT GAG AAG GGC CAC ATC CGG GAC ATC ACA GAC AGC CTG ATT GAG CAC 8 64 

Phe Glu Lys Gly His He Arg Asp lie Thr Asp Ser Leu He Glu His 
275 280 285 
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TGT CAG GAG AAG CAG CTG GAT GAG AAC GCC AAT GTC CAG C7G TCA GAT 912 
Cys Gin Glu Lys Gin Leu Asp Glu Asn Ala Asn Vai Gin Leu Ser Aso 
290 295 300 

5 GAG AAG ATC ATT AAC ATC GTC TTG GAC CTC TTT GGA GCT GGG TTT GAC 960 
Glu Lys lie Xle Asn He Val Leu Asp Leu Phe Gly Ala Gly Phe Aso 
305 310 315 320 

ACA GTC ACA ACT GCT ATC TCC TGG AGC CTC ATG TAT TTG GTG ATG AAC 1008 
Thr Val Thr Thr Ala lie Ser Trp Ser Leu Met Tyr Leu Val Met Asn 
'0 325 330 335 

CCC AGG GTA CAG AGA AAG ATC CAA GAG GAG CTC GAC ACA GTG ATT GGC 1056 
Pro Arg Val Gin Arg Lys lie Gin Glu Glu Leu Asp Thr Val He Gly 
340 345 350 

/5 AGG TCA CGG CGG CCC CGG CTC TCT GAC AGA TCC CAT CTG CCC TAT ATG 1104 
Arg Ser Arg Arg Pro Arg Leu Ser Asp Arg Ser His Leu Pro Tyr Met 
355 360 365 

GAG GCC TTC ATC .CTG GAG ACC TTC CGA CAC TCT TCC TTC GTC CCC TTC - 1152 

Glu Ala Phe He teu Glu Thr Phe Arg His Ser Ser Phe Val Pro Phe 
20 370 375 380 

ACC ATC CCC CAC AGC ACA ACA AGA GAC ACA AGT TTG AAA GGC TTT TAC 1200 
Thr He Pro His Ser Thr Thr Arg Asp Thr Ser Leu Lys Gly Phe Tyr 
385 390 395 400 
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ATC CCC AAG GGG CGT TGT GTC TTT GTA AAC CAG TGG CAG ATC AAC CAT 124 8 

lie Pro Lys Gly Arg Cys Val Phe Val Asn Gin Trp Gin He Asn His 
405 410 415 

GAC CAG AAG CTA TGG GTC AAC CCA TCT GAG TTC CTA CCT GAA CGG TTT ' 1296 
Asp Gin Lys Leu Trp Val Asn Pro Ser Glu ,Phe Leu Pro Glu Arg Phe 
420 425 1 430 

CTC ACC CCT GAT GGT GCT ATC GAC AAG GTG TTA AGT GAG AAG GTG ATT 1344 
Leu Thr Pro Asp Gly Ala lie Asp Lys Val Leu Ser Glu Lys Val lie 
435 440 445 

ATC TTT GGC ATG GGC AAG CGG AAG TGT ATC GGT GAG ACC GTT GCC CGC 13 92 

He Phe Gly Met Gly Lys Arg Lys Cys He Gly Glu Thr Val Ala Arg 
450 455 460 

TGG GAG GTC TTT CTC TTC CTG GCT ATC CTG CTG CAA CGG GTG GAA TTC 144 0 

Trp Glu Val Phe Leu Phe Leu Ala He Leu Leu Gin Arg Val Glu Phe 
465 470 475 450 

AGC GTG CCA CTG GGC GTG AAG GTG GAC ATG ACC CCC ATC TAT GGG CTA 14 88 

Ser Val Pro Leu Gly Val Lys Val Asp Met Thr Pro lie Tyr Gly Leu 
485 490 495 

ACC ATG AAG CAT GCC TGC TGT GAG CAC TTC CAA ATG CAG CTG CGC TCT 1536 
Thr Met Lys His Ala Cys Cys Glu His Phe Gin Met Gin Leu Arg Ser 
500 505 510 

TAG 1539 ■ 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 
(D) ; TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

£xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Phe Pro He Ser Met Ser Ala Thr Glu Phe Leu Leu Ala Ser 
1 5 io 15 

Val He Phe Cys Leu Val Phe Trp Val He Arg Ala Ser Arg Pro Gin 
20 25 30 

Val Pro Lys Gly Leu Lys Asn Pro Pro Gly Pro Trp Gly Trp Pro Leu 
35 40 45 

He Gly His Met Leu Thr Leu Gly Lys Asn Pro His Leu Ala Leu Ser 
50 i. 55 60 

Arg Met Ser Gin Gin Tyr Gly Asp Val Leu Gin lie Arg He Gly Ser 
65 70 75 80 

Thr Pro Val Val Val Leu Ser Gly Leu Asp Thr He Arg Gin Ala Leu 
85 90 95 

Val Arg Gin Gly Asp Asp Phe Lys Gly Arg Pro Asp Leu Tyr Thr Phe 
100 105 no 

Thr Leu lie Ser Asn Gly Gin Ser Met Ser Phe Ser Pro Aso Ser^Gly 
H5 120 125 

Pro Val Trp Ala Ala Arg Arg Arg Leu Ala Gin Asn Gly Leu Lys Ser 
130 135 140 

Phe Ser lie Ala Ser Asp Pro Ala Ser Ser Thr Ser Cys Tyr Leu Glu 
145 150 155 160 

Glu His Val Ser Lys Glu Ala Glu Val Leu He Ser Thr Leu Gin Glu 
165 170 175 

Leu Met Ala Gly Pro Gly His Phe Asn Pro Tyr Arg Tyr Val Val Val 
180 185 190 

Ser Val Thr Asn Val lie Cys Ala lie Cys Phe Gly Arg Arg Tyr Asp 
195 200 205 

His Asn His Gin Glu Leu Leu Ser Leu Val Asn Leu Asn Asn Asn Phe 
210 215 220 

Gly Glu Val Val Gly Ser Gly Asn Pro Ala Asp Phe He Pro lie Leu 
225 230 235 240 

Arg Tyr Leu Pro Asn Pro Ser Leu Asn Ala Phe Lvs Asd Leu Asn Glu 
2 45 250 " * 255 



55 
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Lys Phe Tyr Ser Phe Met Gin Lys Met Val Lys Glu Kis Tyr Lys Thr 
260 265 270 

Phe Glu Lys Gly His He Arg Asp He Thr Asp Ser Leu He Glu His 
275 ^ 280 285 

Cys Gin Glu Lys Gin Leu Aso Glu Asn Ala Asn Val Gin Leu Ser Asp 
290 295 300 

Glu Lys He He Asn He Val Leu Asp Leu Phe Gly Ala Gly Phe Asd 
305 310 315 320 

Thr Val Thr Thr Ala He Ser Trp Ser Leu Met Tyr Leu Val Met Asn 
325 330 335 

Pro Arg Val Gin Arg Lys lie Gin Glu Glu Leu Asp Thr Val He Gly 
340 345 350 

Arg Ser Arg Arg Pro Arg Leu Ser Asp Arg Ser His Leu Pro Tyr Met 
355 * 360 365 

Glu Ala Phe He Leu Glu Thr Phe Arg His Ser Ser Phe Val Pro Phe 
370 375 380 

Thr He Pro His Ser Thr Thr Arg Asp Thr Ser Leu Lys Gly Phe Tyr 
385 390 395 400 

He Pro Lys Gly Arg Cys Val Phe Val Asn Gin Trp Gin lie Asn His 
405 410 415 

V, 

Asp Gin Lys Leu Trp Val Asn Pro Ser Glu Phe Leu Pro Glu Arg Phe ' 
420 425- 430 

Leu Thr Pro Asp Gly Ala lie Asp Lys Val Leu Ser Glu Lys Val He 
435 440 445 

lie Phe Gly Met Gly Lys Arg Lys Cys He Gly Glu Thr Val Ala Arg 
450 455 460 

Trp Glu Val Phe Leu Phe Leu Ala He Leu Leu Gin Arg Val Glu Phe 
465 470 475 480 

Ser Val Pro Leu Gly Val Lys Val Asp Met Thr Pro He Tyr Gly Leu 
485 490 495 



Thr Met Lys His Ala Cys Cys Glu His Phe Gin Met Gin Leu Arg Ser 
500 505 510 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 85 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . .1482 



70 



75 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG CTG GCC TCA GGG ATG CTT CTG GTG GCC TTG CTG GTC TGC CTG ACT 4 3 

Met Leu Ala Ser Gly Met Leu Leu Val Ala Leu Leu Val Cys Leu Thr 
15 10 15 

GTG ATG GTC TTG ATG TCT GTT TGG CAG CAG AGG AAG AGC AAG GGG AAG 96 
Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys Ser Lys Gly Lys 
20 25 30 

CTG CCT CCG GGA CCC ACC CCA TTG CCC TTC ATT GGA AAC TAC CTG CAG 144 
Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe lie Gly Asn Tyr Leu Gin 
35 40 45 

CTG AAC ACA GAG CAG ATG TAC AAC TCC CTC ATG AAG ATC AGT GAG CGC 192 
Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys He Ser Glu Arg 
50 i. 55 60 

TAT GGC CCC GTG TTC ACC ATT CAC TTG GGG CCC CGG CGG GTC GTG GTG 240 
Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 

CTG TGT GGA CAT GAT GCC GTC AGG GAG GCT CTG GTG GAC CAG GCT GAG 288 
Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
85 90 95 

GAG TTC AGC GGG CGA GGC GAG CAA GCC ACC TTC GAC TGG GTC TTC AAA * 336 
Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 no 

GGC TAT GGC GTG GTA TTC AGC AAC GGG GAG CGC GCC AAG CAG CTC CGG 384 
Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

CGC TTC TCC ATC GCC ACC CTG CGG GAC TTC GGG GTG GGC AAG CGA GGC 432 
Arg Phe Ser He Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly ' 
35 130 135 140 

ATC GAG GAG CGC ATC CAG GAG GAG GCG GGC TTC CTC ATC GAC GCC CTC 4 80 

He Glu Glu Arg lie Gin Glu Glu Ala Gly Phe Leu He Asd Ala Leu 
145 150 155 * 160 

40 CGG GGC ACT GGC GGC GCC AAT ATC GAT CCC ACC TTC TTC CTG AGC CGC 528 
Arg Gly Thr Gly Gly Ala Asn lie Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 

ACA GTC TCC AAT GTC ATC AGC TCC ATT GTC TTT GGG GAC CGC TTT GAC ' 576 

Thr Val Ser Asn Val lie Ser Ser lie Val Phe Gly Asp Arg Phe Aso 
45 180 185 190 

TAT .AAG GAC AAA GAG TTC CTG TCA CTG TTG CGC ATG ATG CTA GGA ATC 624 

Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly lie 

195 200 205 

50 



30 



55 
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TTC CAG TTC ACG TCA ACC TCC ACG GGG CAG CTC TAT GAG ATG TTC TCT 
Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 
210 215 220 

TCG GTG ATG AAA CAC CTG CCA GGA CCA CAG CAA CAG GCC TTT CAG TTG 
Ser Val Met: Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 
225 ^30 235 240 

CTG CAA GGG CTG GAG GAC TTC ATA GCC AAG AAG GTG GAG CAC AAC CAG 
Leu Gin Gly Leu Glu Asp Phe lie Ala Lys Lys Val Glu Kis Asn Gin 
245 * 250 255 

CGC ACG CTG GAT CCC AAT TCC CCA CGG GAC TTC ATT GAC TCC TTT CTC 
Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe lie Asp Ser Phe Leu 
260 265 270 

ATC CGC ATG CAG GAG GAG GAG AAG AAC CCC AAC ACG GAG TTC TAC TTG 
He Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu 
275 280 285 

AAA AAC CTG GTG ATG ACC ACG TTG AAC CTC TTC ATT GGG GGC ACC GAG 
Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe lie Gly Gly Thr Glu 
290 ] 295 300 

ACC GTC AGC ACC ACC CTG CGC TAT GGC TTC TTG CTG CTC ATG AAG CAC 
Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 
305 310 315 320 

CCA GAG GTG GAG GCC AAG GTC CAT GAG GAG ATT GAC AGA GTG ATC GGC 
Pro Glu Val Glu Ala Lys Val His Glu Glu He Asp Arg Val He Gly 
325 330 335 

AAG AAC CGG CAG CCC AAG TTT GAG GAC CGG GCC AAG ATG CCC TAC ATG 
Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 
340 345 • 350 

GAG GCA GTG ATC CAC GAG ATC CAA AGA TTT GGA GAC GTG ATC CCC ATG 
Glu Ala Val He His Glu He Gin Arg Phe Gly Asp Val lie Pro Met 
355 360 365 

AGT TTG GCC CGC AGA GTC AAA AAG GAC ACC AAG TTT CGG GAT TTC TTC 
Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arg Asp Phe Phe 
370 375 380 

CTC CCT AAG GGC ACC GAA GTG TAC CCT ATG CTG GGC TCT GTG CTG AGA 
Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 
385 390 395 400 

GAC CCC AGT TTC TTC TCC AAC CCC CAG GAC TTC AAT CCC CAG CAC TTC 
Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 ■ 410 415 

CTG AAT GAG AAG GGG CAG TTT AAG AAG AGT GAT GCT TTT GTG CCC TTT 
Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 

TCC ATC GGA AAG CGG AAC TGT TTC GGA GAA GGC CTG GCC AGA ATG GAG 
Ser He Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 
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CTC TTT CTC TTC TTC ACC ACC GTC ATG CAG AAC TTC CGC C7C AAG TCC 1392 
Leu Phe Leu Phe Phe Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 

5 TCC CAG TCA CCT AAG GAC ATT GAC GTG TCC CCC AGA CAC GTG GGC TTT 144 0 

Ser Gin Ser Pro Lys Asp He Asp Val Ser Pro Arg His Val Gly Phe 
465 * 470 475 480 

GCC ACG ATC CCA CGA AAC TAC ACC ATG AGC TTC CTG CCC CGC 1482 
Ala Thr He Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
10 485 490 

* TGA 1485 



75 



20 



25 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 amino acids 

(B) TYPE: amino acid 
(D) .TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Leu Ala Ser Gly Met Leu Leu Val Ala Leu Leu Val Cys Leu Thr 
15 10 15 

Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys Ser Lys Gly Lvs 
20 25 30 

Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe He Gly Asn Tyr Leu Gin 
35 40 , 45 

Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys He Ser Glu Arg 
50 55 60 

Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 

Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
85 90 95 

Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 110 

Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

Arg Phe Ser He Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly 
130 135 * 140 

lie Glu Glu Arg He Gin Glu Glu Ala Gly Phe Leu He Asp Ala Leu 
145 150 155 160 

Arg Gly Thr Gly Gly Ala Asn He Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 
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Thr Val Ser Asn Val lie Ser Ser lie Val Phe Gly Asp Arg Phe Asp 

180 185 " 190 

Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly lie 

5 195 t 200 205 

Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 

210 215 220 

10 Ser Val Met Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 

225 230 235 240 

Leu Gin Gly Leu Glu Asp Phe He Ala Lys Lys Val Glu His Asn Gin 
245 250 255 



75 



Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe He Asp Ser Phe Leu 
260 265 270 



20 



He Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu 
275 i. 280 285 

Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe He Gly Gly Thr Glu 
290 295 300 



25 



30 



Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 
305 310 315 320 

Pro Glu Val Glu Ala Lys Val His Glu Glu He Aso Arg Val He Gly 
325 330 * 335..-' 

Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 
340 345 ; 350 

Glu Ala Val He His Glu He Gin Arg Phe Gly Asd Val He Pro Met 
355 360 " 365 



35 



Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arg Asp Phe Phe 
370 375 380 



Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 
385 390 395 400 



40 



Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 410 415 



Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 

420 425 430 

45 Ser He Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 

Leu Phe Leu Phe Phe Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 



so Ser Gin Ser Pro Lys Asp He Asp Val Ser Pro Arg His Val Gly Phe 

465 470 475 480 



»• 
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Ala Thr He Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
485 490 

(2) INFORMATION FOR SSQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1485 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1482 

75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG CTG GCC TCA GGG ATG CTT CTG GTG GCC TTG CTG GTC TGC CTG ACT 4 8 

Met Leu Ala Ser Gly Met Leu Leu Val Ala Leu Leu Val Cys Leu Thr 
1.5 10 15 

i 

20 GTG ATG GTC TTG ATG TCT GTT TGG CAG CAG AGG AAG AGC AAG GGG AAG 96 

Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys Ser Lys Gly Lys 
20 25 30 

CTG CCT CCG GGA CCC ACC CCA TTG CCC TTC ATT GGA AAC TAC CTG CAG 144 
Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe He Gly Asn Tyr Leu Gin 
25 35 40 45 

CTG AAC ACA GAG CAG ATG TAC AAC TCC CTC ATG AAG ATC AGT GAG CGC . 192 
Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys He Ser Glu A^g 
50 55 60 

3Q TAT GGC CCC GTG TTC ACC ATT CAC TTG GGG CCC CGG CGG GTC GTG GTG 240 
Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 

CTG TGT GGA CAT GAT GCC GTC AGG GAG GCT CTG GTG GAC CAG GCT GAG 288 
Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
35 8 5 90 95 

GAG TTC AGC GGG CGA GGC GAG CAA GCC ACC TTC GAC TGG GTC TTC AAA 3 36 

Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 no 



40 



45 



50 



GGC TAT GGC GTG GTA TTC AGC AAC GGG GAG CGC GCC AAG CAG CTC CGG 3 84 

Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

CGC TTC TCC ATC GCC ACC CTG CGG GAC TTC GGG GTG GGC AAG CGA GGC - 4 32 

Arg Phe Ser He Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arc Gly 

130 135 140 

ATC GAG GAG CGC ATC CAG GAG GAG GCG GGC TTC CTC ATC GAC GCC CTC 4 80 

He Glu Glu Arg lie Gin Glu Glu Ala Gly Phe Leu He Asp Ala Leu 
145 150 155 160 



»■ 
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15 



20 



25 



30 



35 



40 



45 



50 



CGG 

Arg 



ACA 
Thr 



TAT 
Tyr 



TTC 
Phe 



TCG 
Ser 
225 

CTG 
Leu 



CGC 
Arg 



ATC 

He 



AAA 
Lys 



ACC 
Thr 
305 

CCA 
Pro 



AAG 
Lys 



GAG 
Glu 



AGT 
Ser 



CTC 
Leu 
385 



GGC ACT GGC GGC GCC AAT ATC GAT CCC ACC TTC 
Gly Thr Gly Gly Ala Asn He Asp Pro Thr Phe 
165 170 

GTC TCC AAT GTC ATC AGC TCC ATT GTC TTT GGG 
Val Ser Asn Val He Ser Ser He Val Phe Gly 
180 % 185 

AAG GAC AAA GAG TTC CTG TCA CTG TTG CGC ATG 
Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met 
195 200 

CAG TTC ACG TCA ACC TCC ACG GGG CAG CTC TAT 
Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr 
210 215 220 

GTG ATG AAA CAC CTG CCA GGA CCA CAG CAA CAG 
Val Met Lys His Leu Pro Gly Pro Gin Gin Gin 
230 235 

CAA GGG CTG GAG GAC TTC ATA GCC AAG AAG GTG 
Gin Gly Leu ^Glu Asp Phe He Ala Lys Lys Val 
^45 250 

ACG CTG GAT CCC AAT TCC CCA CGG GAC TTC ATT 
Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe He 
260 265 

CGC ATG CAG GAG GAG GAG AAG AAC CCC AAC ACG 
Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr 
275 280 

AAC CTG GTG ATG ACC ACG TTG AAC CTC TTC ATT 
Asn Leu Val Met Thr Thr Leu Asn Leu Phe He 
290 295 300 

GTC AGC ACC ACC CTG CGC TAT GGC TTC TTG CTG 
Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu 
310 315 

GAG GTG GAG GCC AAG GTC CAT GAG GAG ATT GAC 
Glu Val Glu Ala Lys Val His Glu Glu He Asp 
325 330 

AAC CGG CAG CCC AAG TTT GAG GAC CGG GCC AAG 
Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys 
340 345 

GCA GTG ATC CAC GAG ATC CAA AG A TTT GGA GAC 
Ala Val He His Glu He Gin Arg Phe Gly Asp 
355 360 

TTG GCC CGC AG A GTC AAA AAG GAC ACC AAG TTT 
Leu Ala Arc Arg Val Lys Lys Asp Thr Lys Phe 
370 375 " 380 

CCT AAG GGC ACC GAA GTG TAC CCT ATG CTG GGC 
Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly 
390 395 



TTC CTG AGC CGC 
Phe Leu Ser Arg 
175 

GAC CGC TTT GAC 
Asp Arg Phe Asd 
190 

ATG CTA GGA ATC 
Met Leu Gly He 
205 

GAG ATG TTC TCT 
Glu Met Phe Ser 



GCC TTT 
Ala Phe 



GAG CAC 
Glu His 



GAC TCC 
Asp Ser 
270 

GAG TTC 
Glu Phe 
285 

GGG GGC 
Gly Gly 



CAG TTG 
Gin Leu 
240 

AAC CAG 
Asn Gin 
255 

TTT CTC 
Phe Leu 



TAC TTG 
Tyr Leu 



ACC GAG 
Thr Glu 



CTC ATG 
Leu Met 



AGA GTG 
Arg Val 



ATG CCC 
Met Pro 
350 

GTG ATC 
Val He 
365 

CGG GAT 
Arg Asp 



AAG CAC 
Lys His 
320 

ATC GGC 
He Gly 
335 

TAC ATG 
Tyr Met 



CCC ATG 
Pro Me~ 



TTC TTC 
Phe Phe 



TCT GTG CTG AGA 
Ser Val Leu Arg 
400 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 
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15 



20 



25 



GAC CCC AGT TTC TTC TCC AAC CCC CAG GAC TTC AAT CCC CAG CAC TTC 1248 
Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 410 415 

CTG AAT GAG AAG GGG CAG TTT AAG AAG AGT GAT GCT TTT GTG CCC TTT 12 96 

Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 

TCC ATC GGA AAG CGG AAC TGT TTC GGA GAA GGC CTG GCC AG A ATG GAG 1344 
Ser lie Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 

■CTC TTT CTC TTC TTC ACC ACC GTC ATG CAG AAC TTC CGC CTC AAG TCC 13 92 

Leu Phe Leu Phe Phe Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 

TCC CAG TCA CCT AAG GAC ATT GAC GTG TCC CCC AAA CAC GTG GGC TTT 144 0 

Ser Gin Ser Pro Lys Asp lie Asp Val Ser Pro Lys His Val Gly Phe 
465 470 475 480 

GCC ACG ATC CCA CGA AAC TAC ACC ATG AGC TTC CTG CCC CGC 14 82 

Ala Thr lie Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
485 490 

TGA 1485 

(2) INFORMATION FOR SEQ ID NO: 18: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Ala Ser Gly Met Leu Leu Val Ala Leu Leu Val Cys Leu Thr 
1 5 10 15 

Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys Ser Lys Gly Lys 
20 25 30 

Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe He Gly Asn Tyr Leu Gin 
35 40 45 

Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys He Ser Glu Arc 
50 55 60 

Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 " 80 

45 Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 

85 90 95 

Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp TrD Val Phe Lvs 
100 105 " no 

50 



30 



40 



55 



55 
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Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

Arg Phe Ser He Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly 
130 \ 135 140 

lie Glu Glu Arg He Gin Glu Glu Ala Gly Phe Leu He Asp Ala Leu 

145 150 155 160 



70 



Arg Gly Thr Gly Gly Ala Asn lie Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 



75 



Thr Val Ser Asn Val He Ser Ser He Val Phe Gly Asp Arg Phe Asp 
180 185 190 

Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly He 
195 200 205 



20 



25 



Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 

210 \ 215 220 

Ser Val Met Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 

225 230 235 240 

Leu Gin Gly Leu Glu Asp Phe He Ala Lys Lys Val Glu His Asn Gin 

245 250 255 

Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe He Asp Ser Phe Leu 

260 265 270 ■/' 



30 



He Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu' 
275 280 ' 285 



Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe He Gly Gly Thr Glu 

290 295 300 

Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 

35 305 310 315 320 

Pro Glu Val Glu Ala Lys Val His Glu Glu He Asp Arg Val He Gly 

325 330 335 

4Q Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 

340 345 • 350 

Glu Ala Val He His Glu He Gin Arg Phe Gly Asp Val He Pro Met 

355 360 365 

45 Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arcr Asp Phe Phe 

370 375 * 380 



50 



Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 

385 390 395 400 

Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 

405 410 415 



55 
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Leu Asn Glu Lys Gly Gin ?he Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 

Ser He Gly Lys Arc Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 

Leu Phe Leu Phe Phe > Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 

Ser Gin Ser Pro Lys Asp He Asp Val Ser Pro Lys His Val Gly Phe 
465 470 475 480 

•Ala Thr He Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
485 490 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 14 73 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAA CTC AGC GTC CTC CTC TTC CTT GCA CTC CTC ACA GGA CTC TTG 4 8 

Met Glu Leu Ser Val Leu Leu Phe Leu Ala Leu Leu Thr Gly Leu Leu '** 
15 10 15 



75 



20 



30 



35 



CTA CTC CTG GTT CAG CGC CAC CCT AAC ACC CAT GAC CGC CTC CCA CCA 96 
Leu Leu Leu Val Gin Arg His Pro Asn Thr His Asp Arg Leu Pro Pro 
20 25 30 

GGG CCC CGC CCT CTG CCC CTT TTG GGA AAC CTT CTG CAG ATG GAT AGA 14 4 

Gly Pro Arg Pro Leu Pro Leu Leu Gly Asn Leu Leu Gin Met Asp Arg ' 
35 40 45 

AGA GGC CTA CTC AAA TCC TTT CTG AGG TTC CGA GAG AAA TAT GGG GAC 192 
Arg Gly Leu Leu Lys Ser Phe Leu Arg Phe Arg Glu Lys Tyr Gly Asp 
50 55 60 

GTC TTC ACG GTA CAC CTG GGA CCG AGG CCC GTG GTC ATG CTG TGT GGA 24 0 

Val Phe Thr Val His Leu Gly Pro Arg Pro Val Val Met Leu Cys Gly 
65 70 75 80 

GTA GAG GCC ATA CGG GAG GCC CTT GTG GAC AAG GCT GAG GCC TTC TCT 28 8 

Val Glu Ala He Arg Glu Ala Leu Val Asp Lys Ala Glu Ala Phe Ser 
85 90 95 

GGC CGG GGA AAA ATC GCC ATG GTC GAC CCA TTC TTC CGG GGA TAT GGT 336 
Gly Arg Gly Lys He Ala Met Val Asp Pro Phe Phe Arg Gly Tyr Gly 
100 105 " 110 
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GTG ATC TTT GCC AAT GGA AAC CGC TGG AAG GTG CTT CGG CGA TTC TCT 334 
Val lie Phe Ala Asn Gly Asn Arg Trp Lys Val Leu Arg Arg Phe Ser 
115 120 125 

5 GTG ACC ACT ATG AGG GAC TTC GGG ATG GGA AAG CGG AGT GTG GAG GAG 4 32 

Val Thr Thr Met Arg Asp Phe Gly Met Gly Lys Arg Ser Val Glu Glu 
130 135 140 

CGG ATT CAG GAG GAG GCT CAG TGT CTG ATA GAG GAG CTT CGG AAA TCC 4 80 

Arg lie Gin Glu Glu Ala Gin Cys Leu He Glu Glu Leu Arg Lys Ser 
'0 145 150 155 160 

AAG GGG GCC CTC ATG GAC CCC ACC TTC CTC TTC CAG TCC ATT ACC GCC 528 
Lys Gly Ala Leu Met Asp Pro Thr Phe Leu Phe Gin Ser He Thr Ala 
165 170 175 

15 AAC ATC ATC TGC TCC ATC GTC TTT GGA AAA CGA TTC CAC TAC CAA GAT 576 
Asn He He Cys Ser He Val Phe Gly Lys Arg Phe His Tyr Gin Asp 
180 185 190 

CAA GAG TTC CTG AAG ATG CTG AAC TTG TTC TAC CAG ACT TTT TCA CTC ■ 6 24 

Gin Glu Phe Leu Iiys Met Leu Asn Leu Phe Tyr Gin Thr Phe Ser Leu 
20 1 9 5 2 0 0 2 0 5 

ATC AGC TCT GTA TTC GGC CAG CTG TTT GAG CTC TTC TCT GGC TTC TTG 672 
He Ser Ser Val Phe Gly Gin Leu Phe Glu Leu Phe Ser Gly Phe Leu 
210 215 220 



25 



30 



35 



AAA TAC TTT CCT GGG GCA CAC AGG CAA GTT TAC AAA AAC CTG CAG GAA 720 
Lys Tyr Phe Pro Gly Ala His Arg Gin Val Tyr Lys Asn Leu Gin Glu 
225 230 235 240 

ATC AAT GCT TAC ATT GGC CAC AGT GTG GAG AAG CAC CGT GAA ACC CTG 768 
He Asn Ala Tyr He Gly His Ser Val Glu Lys His Arg Glu Thr Leu 
245 250 ' 255 

GAC CCC AGC GCC CCC AAG GAC CTC ATC GAC ACC TAC CTG CTC CAC ATG 816 
Asp Pro Ser Ala Pro Lys Asp Leu He Asp Thr Tyr Leu Leu His Met 
260 265 270 

GAA AAA GAG AAA TCC AAC GCA CAC AGT GAA TTC AGC CAC CAG AAC CTC 8 64 

Glu Lys Glu Lys Ser Asn Ala His Ser Glu Phe Ser His Gin Asn Leu 
275 280 285 

AAC CTC AAC ACG CTC TCG CTC TTC TTT GCT GGC ACT GAG ACC ACC AGC 912 
Asn Leu Asn Thr Leu Ser Leu Phe Phe Ala Gly Thr Glu Thr Thr Ser 
290 295 300 

ACC ACT CTC CGC TAC GGC TTC CTG CTC ATG CTC AAA TAC CCT CAT GTT 9 60 

Thr Thr Leu Arg Tyr Gly Phe Leu Leu Met Leu Lys Tyr Pro His Val 
305 310 315 320 

GCA GAG AGA GTC TAC AGG GAG ATT GAA CAG GTG ATT GGC CCA CAT CGC 100 8 

Ala Glu Arg Val Tyr Arg Glu lie Glu Gin Val He Gly Pro His Arg 
325 330 335 

CCT CCA GAG CTT CAT GAC CGA GCC AAA ATG CCA TAC ACA GAG GCA GTC 10 56 

Pro Pro Glu Leu His Asp Arg Ala Lys Met Pro Tyr Thr Glu Ala Val 
340 " 345 350 
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ATC TAT GAG ATT CAG AGA TTT TCC GAC CTT CTC CCC ATG GGT GTG CCC 1104 
lie Tyr Glu lie Gin Arg Phe Ser Asp Leu Leu Pro Met Gly Val Pro 
355 360 365 

CAC ATT GTC ACC CAA CAC ACC AGC TTC CGA GGG TAG ATC ATC CCC AAG 1152 
His He Val Thr Gin His Thr Ser Phe Arg Gly Tyr He He Pro Lys 
370 * 375 380 

GAC ACA GAA GTA TTT CTC ATC CTG AGC ACT GCT CTC CAT GAC CCA CAC 1200 
Asp Thr Glu Val Phe Leu He Leu Ser Thr Ala Leu His Asd Pro His 
335 390 395 * 400 

TAC TTT GAA AAA CCA GAC GCC TTC AAT CCT GAC CAC TTT CTG GAT GCC 124 8 

Tyr Phe Glu Lys Pro Asp Ala Phe Asn Pro Asp His Phe Leu Asp Ala 
405 410 415 

AAT GGG GCA CTG AAA AAG ACT GAA GCT TTT ATC CCC TTC TCC TTA GGG 1296 
Asn Gly Ala Leu Lys Lys Thr Glu Ala Phe lie Pro Phe Ser Leu Gly 
420 425 430 

AAG CGG ATT TGT CTT GGT GAA GGC ATC GCC CGT GCG GAA TTG TTC CTC 1344 
Lys Arg He Cy.s Leu Gly Glu Gly He Ala Arg Ala Glu Leu Phe Leu 
435 * 440 445 

TTC TTC ACC ACC ATC CTC CAG AAC TTC TCC ATG GCC AGC CCC GTG GCC 13 92 

Phe Phe Thr Thr lie Leu Gin Asn Phe Ser Met Ala Ser Pro Val Ala 
450 455 460 

CCA GAA GAC ATC GAT CTG ACA CCC CAG GAG TGT GGT GTG GGC AAA ATA 144 0 

Pro Glu Asp He Asp Leu Thr Pro Gin Glu Cys Gly Val Gly Lys He 
465 470 475 480 

CCC CCA ACA TAC CAG ATC CGC TTC CTG CCC CGC TGA 14 76 

Pro Pro Thr Tyr Gin He Arg Phe Leu Pro Arg 
485 490 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 91 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

40 

Met Glu Leu Ser Val Leu Leu Phe Leu Ala Leu Leu Thr Gly Leu Leu 
15 10 15 

Leu Leu Leu Val Gin Arg His Pro Asn Thr His Asp Arg Leu Pro Pro 
20 25 30 



25 



30 



45 



Gly Pro Arc Pro Leu Pro Leu Leu Gly Asn Leu Leu Gin Met Asp Arg 
35 40 45 
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Arg Gly Leu Leu Lys Ser Phe Leu Arg Phe Arg Glu Lys Tyr Gly Asp 
50 55 60 

Val Phe Thr Val His Leu Gly Pro Arg Pro Val Val Met Leu Cys Gly 
5 65 70 75 80 

Val Glu Ala lie Arg Glu Ala Leu Val Asp Lys Ala Glu Ala Phe Ser 
85 90 95 

Gly Arg Gly Lys He Ala Met Val Asp Pro Phe Phe Arg Gly Tyr Gly 
W 100 105 " 110 

Val He Phe Ala Asn Gly Asn Arg Trp Lys Val Leu Arc Arg Phe Ser 
115 120 125 

Val Thr Thr Met Arg Asp Phe Gly Met Gly Lys Arg Ser Val Glu Glu 
'5 130 135 140 

Arg He Gin Glu Glu Ala Gin Cys Leu He Glu Glu Leu Arg Lys Ser 
145 150 155 160 

Lys Gly Ala Xeu Met Asp Pro Thr Phe Leu Phe Gin Ser He Thr Ala 
20 165 170 175 

Asn lie lie Cys Ser lie Val Phe Gly Lys Arg Phe Kis Tyr Gin Asp 
180 185 190 

Gin Glu Phe Leu Lys Met Leu Asn Leu Phe Tyr Gin Thr Phe Ser Leu 
25 195 200 205 

lie Ser Ser Val Phe Gly Gin Leu Phe Glu Leu Phe Ser Gly Phe Leu 
210 215 220 

Lys Tyr Phe Pro Gly Ala His Arg Gin Val Tyr Lys Asn Leu Gin Glu 
50 225 230 235 240 

lie Asn Ala Tyr He Gly His Ser Val Glu Lys His Arg Glu Thr Leu 
245 250 255 



35 



40 



45 



50 



Asp Pro Ser Ala Pro Lys Asp Leu He Asp Thr Tyr Leu Leu His Met 
260 265 270 

Glu Lys Glu Lys Ser Asn Ala His Ser Glu Phe Ser His Gin Asn Leu 
275 280 285 

Asn Leu Asn Thr Leu Ser Leu Phe Phe Ala Gly Thr Glu Thr Thr Ser 
290 295 300 

Thr Thr Leu Arg Tyr Gly Phe Leu Leu Met Leu Lys Tvr Pro His Val 

305 310 315 * 320 ' 

Ala Glu Arg Val Tyr Arg Glu lie Glu Gin Val He Gly Pro His Arg 
325 330 335 

Pro Pro Glu Leu His Asp Arg Ala Lys Met Pro Tyr Thr Glu Ala Val 
340 " 345 350 

lie Tyr Glu He Gin Arg Phe Ser Asd Leu Leu Pro Met Glv Val Pro 
355 360 " 365 
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His He Val Thr Gin His Thr Ser Phe Arg Gly Tyr He He Pro Lys 
370 375 380 

Asp Thr Glu Val Phe Leu He Leu Ser Thr Ala Leu His Aso Pro His 
355 390 395 " 400 

Tyr Phe Glu Lys Pro Asp Ala Phe Asn Pro Asp His Phe Leu Asp Ala 
405 410 415 

Asn Gly Ala Leu Lys Lys Thr Glu Ala Phe He Pro Phe Ser Leu Gly 
420 425 430 

Lys Arg He Cys Leu Gly Glu Gly He Ala Arg Ala Glu Leu Phe Leu 
435 440 445 

Phe Phe Thr Thr He Leu Gin Asn Phe Ser Met Ala Ser Pro Val Ala 
450 455 460 

Pro Glu Asp He Asp Leu Thr Pro Gin Glu Cys Gly Val Gly Lys He 
465 470 475 480 

Pro Pro Thr Tyr Gin He Arg Phe Leu Pro Arg 
465 490 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

0 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
30 (B) LOCATION: 1..1470 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG GAA CCT TTT GTG GTC CTG GTG CTG TGT CTC TCT TTT ATG CTT CTC 4 8 

35 Met Glu Pro Phe Val Val Leu Val Leu Cys Leu Ser Phe Met Leu Leu 
15 10 15 

TTT TCA CTC TGG AGA CAG AGC TGT AGG AGA AGG AAG CTC CCT CCT GGC 96 
Phe Ser Leu Trp Arg Gin Ser Cys Arg Arg Arg Lys Leu Pro Pro Gly 
20 25 30 



CCC ACT CCT CTT CCT ATT ATT GGA AAT ATG CTA CAG ATA GAT GTT AAG 14 4 

Pro Thr Pro Leu Pro He lie Gly Asn Met Leu Gin He Asp Val Lys 
35 40 45 

GAC ATC TGC AAA TCT TTC ACC AAT TTC TCA AAA GTC TAT GGT CCT GTG 192 
Asp He Cys Lys Ser Phe Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 
50 * 55 60 
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TTC ACC GTG TAT TTT GGC ATG AAT CCC ATA GTG GTG TTT CAT GGA TAT 240 
Phe Thr Val Tyr Phe Gly Met Asn Pro He Val Val Phe His Gly Tyr 
65 70 75 80 

5 GAG GCA GTG AAG GAA GCC CTG ATT GAT AAT GGA GAG GAG TTT TCT GGA 288 
Glu Ala Val Lys Glu Ala Leu He Asp Asn Gly Glu Glu Phe Ser Gly 
85 90 95 

AGA GGC AAT TCC CCA ATA TCT CAA AGA ATT ACT AAA GGA CTT GGA ATC 336 
Arg Gly Asn Ser Pro He Ser Gin Arg He Thr Lys Gly Leu Gly He 
™ 100 105 110 

ATT TCC AGC AAT GGA AAG AGA TGG AAG GAG ATC CGG CGT TTC TCC CTC 384 
He Ser Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu 
115 120 125 

75 ACA ACC TTG CGG AAT TTT GGG ATG GGG AAG AGG AGC ATT GAG GAC CGT 432 
Thr Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser He Glu Asp Arg 
130 135 140 

GTT CAA GAG. GAA GCT CAC TGC CTT GTG GAG GAG TTG AGA AAA ACC AAG 480 
Val Gin Glu Glu Ala His Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
20 145 150 155 160 

GCT TCA CCC TGT GAT CCC ACT TTC ATC CTG GGC TGT GCT CCC TGC AAT 528 
Ala Ser Pro Cys Asp Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn 
165 170 175 
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30 



35 



40 



45 



50 



GTG ATC TGC TCC GTT GTT TTC CAG AAA CGA TTT GAT TAT AAA GAT CAG 576 
Val He Cys Ser Val Val Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 

AAT TTT CTC ACC CTG ATG AAA AGA TTC AAT GAA AAC TTC AGG ATT CTG 624 
Asn Phe Leu Thr Leu Met Lys Arg Phe Asn Glu Asn Phe Arg He Leu 
195 200 205 

AAC TCC CCA TGG ATC CAG GTC TGC AAT AAT TTC CCT CTA CTC ATT GAT 672 
Asn Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Leu Leu He Asp 
210 215 220 

TGT TTC CCA GGA ACT CAC AAC AAA GTG CTT AAA AAT GTT GCT CTT ACA ' 720 
Cys Phe Pro Gly Thr His Asn Lys Val Leu Lys Asn Val Ala Leu Thr 
225 230 235 240 

CGA AGT TAC ATT AGG GAG AAA GTA AAA GAA CAC CAA GCA TCA CTG GAT 768 
Arg Ser Tyr He Arg Glu Lys Val Lys Glu His Gin Ala Ser Leu Asp 
245 250 255 

GTT AAC AAT CCT CGG GAC TTT ATC GAT TGC TTC CTG ATC AAA ATG GAG 816 
Val Asn Asn Pro Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu 
260 265 270 

CAG GAA AAG GAC AAC CAA AAG TCA GAA TTC AAT ATT GAA AAC TTG GTT 864 
Gin Glu Lys Asp Asn Gin Lys Ser Glu Phe Asn He Glu Asn Leu Val 
275 280 285 

GGC ACT GTA GCT GAT CTA TTT GTT GCT GGA ACA GAG ACA ACA AGC ACC 912 
Gly Thr Val Ala Asp Leu Phe Val Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 300 
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ACT CTG AGA TAT GGA CTC CTG CTC CTG CTG AAG CAC CCA GAG GTC ACA 960 
Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
3C5 310 315 320 

GCT AAA GTC CAG GAA GAG ATT GAT CAT GTA ATT GGC AGA CAC AGG AGC 1008 
Ala Lys Val Gin Gly Glu lie Asp His Val lie Gly Arg His Arg Ser 
32S 330 335 

CCC TGC ATG CAG GAT AGG AGC CAC ATG CCT TAC ACT GAT GCT GTA GTG 1056 
Pro Cys Met Gin Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 345 350 

• CAC GAG ATC CAG AGA TAC AGT GAC CTT GTC CCC ACC GGT GTG CCC CAT 1104 
His Glu lie Gin Arg Tyr Ser Asp Leu Val Pro Thr Gly Val Pro His 
355 360 365 

GCA GTG ACC ACT GAT ACT AAG TTC AGA AAC TAC CTC ATC CCC AAG GGC 1152 
Ala Val Thr Thr Asp Thr Lys Phe Arg Asn Tyr Leu He Pro Lys Gly 
370 375 380 

ACA ACC ATA ATG GCA TTA CTG ACT TCC GTG CTA CAT GAT GAC AAA GAA 1200 
Thr Thr He Met Ala Leu Leu Thr Ser Val Leu His Asp Asp Lys Glu 
385 1 390 395 400 

TTT CCT AAT CCA AAT ATC TTT GAC CCT GGC CAC TTT CTA GAT AAG AAT 1248 
Phe Pro Asn Pro Asn He Phe Asp Pro Gly His Phe Leu Asp Lys Asn 
405 410 415 

GGC AAC TTT AAG AAA AGT GAC TAC TTC ATG CCT TTC TCA GCA GGA AAA 12 96 

Gly Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 

CGA ATT TGT GCA GGA GAA GGA CTT GCC CGC ATG GAG CTA TTT TTA TTT v 1344 
Arg lie Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
43 5 44 0 \ 44 5 

CTA ACC ACA ATT TTA CAG AAC TTT AAC CTG AAA TCT GTT GAT GAT TTA 13 92 

Leu Thr Thr He Leu Gin Asn Phe Asn Leu Lys Ser Val Asp Asp Leu 
450 455 460 

AAG AAC CTC AAT ACT ACT GCA GTT ACC AAA GGG ATT GTT TCT CTG CCA 144 0 

Lys Asn Leu Asn Thr Thr Ala Val Thr Lys Gly He Val Ser Leu Pro 
465 470 475 4S0 

CCC TCA TAC CAG ATC TGC TTC ATC CCT GTC TGA 14 73 

Pro Ser Tyr Gin He Cys Phe lie Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Glu Pro Phe Val Val Leu Val Leu Cys Leu Ser Phe Met Leu Leu 

15 10 15 

Phe Ser Leu Trp Arg Glri Ser Cys Arg Arg Arg Lys Leu Pro Pro Gly 

20 25 30 

Pro Thr Pro Leu Pro lie lie Gly Asn Met Leu Gin lie Asp Val Lys 

35 40 45 

■ Asp lie Cys Lys Ser Phe Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 

50 55 60 

Phe Thr Val Tyr Phe Gly Met Asn Pro He Val Val Phe His Gly Tyr 

65 70 75 80 

Glu Ala Val Lys Glu Ala Leu He Asp Asn Gly Glu Glu Phe Ser Gly 

85 90 95 

Arg Gly Asn Ser Pro He Ser Gin Arg He Thr Lys Gly Leu Gly He 

20 1 00 1 05 110 

He Ser Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu 

115 120 125 

"■r Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser lie Glu Asp Arg 

25 130 135 140 

Val Gin Glu Glu Ala His Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 

145 150 155 160 



75 



30 



35 



40 



Ala Ser Pro Cys Asp Pro Thr Phe He 1 Leu Gly Cys Ala Pro Cys Asn 

165 170 175 

Val lie Cys Ser Val Val Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 * 190 

Asn Phe Leu Thr Leu Met Lys Arg Phe Asn Glu Asn Phe Arg lie Leu 
195 200 205 

Asn Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Leu Leu lie Asp 
210 215 220 

Cys Phe Pro Gly Thr His Asn Lys Val Leu Lys Asn Val Ala Leu Thr 
225 230 235 240 

Arg Ser Tvr lie Arg Glu Lys Val Lys Glu His Gin Ala Ser Leu Asp 

245 250 255 

Val Asn Asn Pro Arg Asp Phe He Asp Cys Phe Leu lie Lys Met Glu 
260 * 265 270 

Gin Glu Lys Asp Asn Gin Lys Ser Glu Phe Asn lie Glu Asn Leu Val 
275 *" 280 285 

Gly Thr Val Ala Asp Leu Phe Val Ala Gly Thr Glu Thr Thr Ser Thr 
290 " 295 300 
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Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

Ala Lys Val Gin Glu Glu lie Asd His Val lie Gly Arg His Arg Ser 
325 330 335 

Pro Cys Met Gin Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 345 350 

His Glu lie Gin Arg Tyr Ser Asp Leu Val Pro Thr Gly Val Pro His 
355 360 365 

Ala Val Thr Thr Asp Thr Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly 
370 375 380 

Thr Thr lie Met Ala Leu Leu Thr Ser Val Leu Kis Asp Asp Lys Glu 
385 390 395 " 400 

Phe Pro Asn Pro Asn lie Phe Asp Pro Gly His Phe Leu Asp Lys Asn 
405 410 " 415 

Gly Asn Phe Lyis Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 

Arg lie Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
435 440 445 

Leu Thr Thr lie Leu Gin Asn Phe Asn Leu Lys Ser Val Asp Asp Leu 
450 455 460 

Lys Asn Leu Asn Thr Thr Ala Val Thr Lys Gly lie Val Ser Leu Pro 
465 470 475 480, 

Pro Ser Tyr Gin lie Cys Phe lie Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 
(5) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 1470 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: - 

ATG GAA CCT TTT GTG GTC CTG GTG CTG TGT CTC TCT TTT ATG CTT CTC 4 8 

Met Glu Pro Phe Val Val Leu Val Leu Cys Leu Ser Phe Met Leu Leu 

1 5 10 15 

TTT TCA CTC TGG AGA CAG AGC TGT AGG AG A AGG AAG CTC CCT CCT C-GC 96 

Phe Ser Leu Trp Arg Gin Ser Cys Arg Arg Arc Lys Leu Pro Pro Gly 

20 25 " 30 
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ccc 

Pro 


ACT 
Thr 


CCT 
Pro 
35 


CTT 
Leu 


CCT 
Pro 


ATT 
He 


ATT GGA 
He Gly 
40 


AAT 
Asn 


ATG 
Met 


CTA 
Leu 


CAG 
Gin 


ATA 
He 
45 


GAT 
Asp 


GTT 
Val 


AAG 
Lys 


144 


GAC 
Asp 


ATC 
lie 
50 


TGC 
Cys 


AAA 
Lys 


TCT 
Ser 


TTC 
Phe 


ACC 
Thr 
55 


AAT 
Asn 


TTC 
Phe 


TCA 
Ser 


AAA 
Lys 


GTC 
Val 
60 


TAT 
Tyr 


GGT 
Gly 


CCT 
Pro 


GTG 
Val 


192 


TTC 
Phe 
65 


ACC 
Thr 


GTG 
Val 


TAT 
Tyr 


TTT 
Phe 


GGC ATG 
Gly Met 
70 


AAT 
Asn 


CCC 
Pro 


ATA 
He 


GTG 
Val 
75 


GTG 
Val 


TTT 
Phe 


CAT 
His 


GGA 
Gly 


TAT 
Tyr 
80 


240 


GAG 
Glu 


GCA 
Ala 


GTG 
Val 


AAG 
Lys 


GAA 
Glu 
85 


GCC 
Ala 


CTG 
Leu 


ATT 
lie 


GAT 
Asp 


AAT 
Asn 
90 


GGA 
Gly 


GAG 
Glu 


GAG 
Glu 


TTT 
Phe 


TCT 
Ser 
95 


GGA 
Gly 


288 


AGA 
Arg 


GGC 
Gly 


AAT 
Asn 


TCC 
Ser 
100 


CCA 
Pro 


ATA 
He 


TCT 
Ser 


CAA 
Gin 


AGA 
Arg 
105 


ATT 
He 


ACT 
Thr 


AAA 
Lys 


GGA 
Gly 


CTT 
Leu 
110 


GGA 
Gly 


ATC 
He 


336 


ATT 
He 


TCC 
Ser 


AGC 
Ser 
115 


AAT GGA 
Asn :Gly 


AAG 
Lys 


AGA 
Arg 


TGG 
Trp 
120 


AAG 
Lys 


GAG 
Glu 


ATC 
He 


CGG 
Arg 


CGT 
Arg 
125 


TTC 
Phe 


TCC 
Ser 


CTC 
Leu 


384 


ACA 
Thr 


ACC 
Thr 
130 


TTG 
Leu 


CGG 
Arg 


AAT 
Asn 


TTT GGG ATG 
"Phe Gly Met 
135 


GGG 
Gly 


AAG 
Lys 


AAG 
Lys 


AGC 
Ser 
140 


ATT 
He 


GAG 
Glu 


GAC 
Asp 


CGT 
Arg 


432 


GTT 
Val 
14 5 


CAA 
Gin 


GAG 
Glu 


GAA 
Glu 


GCT 
Ala 


CAC 
His 
150 


TGC 
Cys 


CTT 
Leu 


GTG 
Val 


GAG 
Glu 


GAG 
Glu 
155 


TTG 
Leu 


AGA 
Arg 


AAA 
Lys 


ACC 
Thr 


AAG 
Lys 
160 


480 


GCT 
Ala 


TCA 
Ser 


CCC 
Pro 


TGT 
Cys 


GAT 
Asp 
165 


CCC 
Pro 


ACT 
Thr 


TTC 
Phe 


ATC 
He 


CTG 
Leu 
170 , 


GGC 
Gly 


TGT 
Cys 


GCT 
Ala 


CCC 
Pro 


TGC 
Cys 
175 


AAT 
Asn 


528 


GTG 
Val 


ATC 
He 


TGC 
Cys 


TCC 
Ser 
180 


GTT 
Val 


GTT 
Val 


TTC 
Phe 


CAG 
Gin 


AAA 
Lys 
185 


CGA 
Arg 


TTT 
Phe 


GAT 
Asp 


TAT 
Tyr 


AAA 
Lys 
190 


GAT 
Asp 


CAG 
Gin 


576 


AAT 
Asn 


TTT 
Phe 


CTC 
Leu 
195 


ACC 
Thr 


CTG 
Leu 


ATG 
Met 


AAA 
Lys 


AGA 
Arg 
200 


TTC 
Phe 


AAT 
Asn 


GAA 
Glu 


AAC 
Asn 


TTC 
Phe 
205 


AGG 
Arg 


ATT 
He 


CTG 
Leu 


624 


AAC 
Asn 


TCC 
Ser 
210 


CCA 
Pro 


TGG 
Trp 


ATC 
He 


CAG 
Gin 


GTC 
Val 
215 


TGC 
Cys 


AAT 
Asn 


AAT 
Asn 


TTC 
Phe 


CCT 
Pro 
220 


CTA 
Leu 


CTC 
Leu 


ATT 
He 


GAT 
Asp 


672 


TGT 
Cys 
225 


TTC 
Phe 


CCA 
Pro 


GGA 
Gly 


ACT 
Thr 


CAC 
His 
230 


AAC 
Asn 


AAA 
Lys 


GTG 
Val 


CTT 
Leu 


AAA 
Lys 
235 


AAT 
Asn 


GTT 
Val 


GCT 
Ala 


CTT 
Leu 


ACA 
Thr 
240 


720 


CGA 
Arg 


AGT 
Ser 


TAC 
Tyr 


ATT 
He 


AGG 
Arg 
245 


GAG 
Glu 


AAA 
Lys 


GTA 
Val 


AAA 
Lys 


GAA 
Glu 
250 


CAC 
His 


CAA 
Gin 


GCA 
Ala 


TCA 
Ser 


CTG 
Leu 
255 


GAT 
Asp 


768 
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GTT 
val 


AAC 
Asn 


AAT 
Asn 


CCT 
Pro 
260 


CGG 
Arg 


GAC 
Asp 


TTT 
Phe 


ATC 
He 


GAT 
Aso 
265 


TGC 
Cys 


TTC 
Phe 


CTG 
Leu 


ATC 
He 


AAA 
Lys 
270 


ATG 
Met 


GAG 
Glu 


316 


5 


CAG 
Gin 


GAA 
Glu 


AAG 
Lys 
275 


GAC 
Asp 


AAC 
Asn 


CAA 
Gin 


AAG 
Lys 


TCA 
Ser 
280 


GAA 
Glu 


TTC 
Phe 


AAT 
Asn 


ATT 
He 


GAA 
Glu 
285 


AAC 
Asn 


TTG 
Leu 


GTT 
Val 


364 


JO 


GGC 
Gly 


ACT 
Thr 


GTA 
Val 


GCT 
Ala 


GAT 
Asp 


CTA 
Leu 


TTT 
Phe 
295 


GTT 
Val 


GCT 
Ala 


GGA ACA 
Gly Thr 


GAG 
Glu 
300 


ACA 
Thr 


ACA 
Thr 


AGC 
Ser 


ACC 
Thr 


912 




• ACT 
Thr 


CTG 
Leu 


AGA 
Arg 


TAT 
Tyr 


GGA 
Gly 


CTC 
Leu 
310 


CTG 
Leu 


CTC 
Leu 


CTG 
Leu 


CTG 
Leu 


AAG 
Lys 
315 


CAC 
His 


CCA 
Pro 


GAG 
Glu 


GTC 
Val 


ACA 
Thr 
320 


960 


75 


GCT 
Ala 


AAA 
Lys 


GTC 
Val 


CAG 
Gin 


GAA 
Glu 
32 5 


GAG 
Glu 


ATT 
lie 


GAT 
Asp 


CAT 
His 


GTA 
Val 
330 


ATT 
lie 


GGC AGA CAC AGG 
Gly Arg His Arg 
335 


AGC 
Ser 


1008 


20 


ccc 

Pro 


TGC 
Cys 


ATG 
Met 


CAG 
Gin 
3 4b 


GAT 
Asp 


AGG 
Arg 


AGC 
Ser 


CAC 
His 


ATG 
Met 
345 


CCT 
Pro 


TAC 
Tyr 


ACT GAT GCT 
Thr Asp Ala 
350 


GTA 
Val 


GTG 
Val 


1056 


CAC 
His 


GAG 
Glu 


ATC 
He 
j j j 


CAG 
Gin 


AGA 
Arg 


TAC 
Tyr 


AGT 
Ser 


GAC 
Asp 
3 60 


CTT 
Leu 


GTC 
Val 


CCC 
Pro 


ACC GGT GTG 
Thr Gly Val 
365 


CCC 
Pro 


CAT 
His 


1104 


25 


GCA 
Ala 


GTG 
Val 
o / u 


ACC 
Thr 


ACT 
Thr 


GAT 
Asp 


ACT 
Thr 


AAG 
Lys 
375 


TTC 
Phe 


AGA 
Arg 


AAC 
Asn 


TAC 
Tyr 


CTC 
Leu 
380 


ATC 
He 


CCC 
Pro 


AAG 
Lys 


GGC 
Gly 


1152 




ACA 
Thr 
385 


ACC 
Thr 


ATA 
lie 


ATG 
Met 


GCA 
Ala 


TTA 
Leu 

1 Q ft 

J J V 


CTG 
Leu 


ACT 
Thr 


TCC 
Ser 


GTG 
Val 


CTA 
Leu 
395 


CAT 
His 


GAT GAC 
Asp Asp 


AGA 
Arg 


GAA 
Glu 
400 


1200 


30 


TTT 
Phe 


CCT 
Pro 


AAT 
Asn 


CCA 
Pro 


AAT 
Asn 
*± v 0 


ATC 
He 


TTT 
Phe 


GAC 
Asp 


CCT 
Pro 


GGC CAC 
Gly His 
410 


TTT 
Phe 


CTA 
Leu 


GAT 
Asp 


AAG 
Lys 
415 


AAT 
Asn 


1248 


35 


GGC AAC 
Gly Asn 


TTT 
Phe 


AAG 
Lys 

/OA 
*i £, U 


AAA 
Lys 


AGT 
Ser 


GAC 
Asp 


TAC 
Tyr 


TTC 
Phe 
4 2 5 


ATG 
Met 


CCT 
Pro 


TTC 
Phe 


TCA 
Ser 


GCA 
Ala 
430 


GGA 
Gly 


AAA 
Lys 


1296 




CGA 
Arg 


ATT 
lie 


TGT 
Cys 

4 J J 


GCA 
Ala 


GGA 
Gly 


GAA 
Glu 


GGA 
Gly 


CTT 
Leu 
44 0 


GCC 
Ala 


CGC 
Arg 


ATG 
Met 


GAG 
Glu 


CTA 
Leu 
445 


TTT 
Phe 


TTA 
Leu 


TTT 
Phe 


1344 


40 


CTA 
Leu 


ACC 
Thr 
450 


ACA 
Thr 


ATT 
lie 


TTA 
Leu 


CAG 
Gin 


AAC 
Asn 
455 


TTT 
Phe 


AAC 
Asn 


CTG 
Leu 


AAA 
Lys 


TCT 
Ser 
460 


GTT 
Val 


GAT 
Asp 


GAT 
Asp 


TTA 
Leu 


1392 


45 


AAG 
Lys 
465 


AAC 
Asn 


CTC 
Leu 


AAT 
Asn 


ACT 
Thr 


ACT 
Thr 
470 


GCA 
Ala 


GTT 
Val 


ACC 
Thr 


AAA 
Lys 


GGG 
Gly 
475 


ATT 
He 


GTT 
Val 


TCT 
Ser 


CTG 
Leu 


CCA 
Pro 
430 


1440 




CCC 
Pro 


TCA 
Ser 


TAC 
Tyr 


CAG 
Gin 


ATC 
He 
485 


TGC 
Cys 


TTC 
Phe 


ATC 
He 


CCT 
Pro 


GTC 
Val 
490 


TGA 












1473 



50 



55 



67 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Glu Pro Phe Val Val Leu Val Leu Cys Leu Ser Phe Met Leu Leu 
15 10 15 

Phe Ser Leu Trp Arg Gin Ser Cys Arg Arg Arg Lys Leu Pro Pro Gly 
75 20 25 30 

Pro Thr Pro Leu Pro lie lie Gly Asn Met Leu Gin lie Asp Val Lys 
35 40 45 



20 



30 



35 



45 



Asp lie Cys L^s Ser Phe Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 

50 55 60 

L-he Thr Val Tyr Phe Gly Met Asn Pro He Val Val Phe His Gly Tyr 

65 70 75 80 

25 Glu Ala Val Lys Glu Ala Leu He Asp Asn Gly Glu Glu Phe Ser Gly 

85 90 95 

Arg Gly Asn Ser Pro He Ser Gin Arg He Thr Lys Gly Leu Gly'^Ile 

100 105 110 



50 



He Ser Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu 
115 120 125 

Thr Thr Leu Arg Asn Phe Gly Met Gly Lys Lys Ser lie Glu Asp Arg 
130 135 140 

Val Gin Glu Glu Ala His Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
145 150 155 160 

Ala Ser Pro Cys Asp Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn 
165 170 175 

Val He Cys Ser Val Val Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 

Asn Phe Leu Thr Leu Met Lys Arg Phe Asn Glu Asn Phe Arg lie Leu 
195 200 205 

Asn Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Leu Leu lie Asp 
210 215 220 

Cys Phe Pro Gly Thr His Asn Lys Val Leu Lys Asn Val Ala Leu Thr 
225 230 235 240 
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15 



Arg Ser Tyr lie Arg Glu Lys Val Lys Glu His Gin Ala Ser Leu Asp 
245 250 255 

Val Asn Asn Pro Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu 
260.- 265 270 

Gin Glu Lys Asp Asn Gin Lys Ser Glu Phe Asn He Glu Asn Leu Val 
275 280 285 

Gly Thr Val Ala Aso Leu Phe Val Ala Gly Thr Glu Thr Thr Ser Thr 
290 * 295 300 

Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

Ala Lys Val Gin Glu Glu He Asp His Val He Gly Arg His Arg Ser 
325 330 335 

Pro Cys MetiGln Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
: 340 345 350 

His Glu He Gin. Arg Tyr Ser Asp Leu Val Pro Thr Gly Val Pro His 
355 360 365 

Ala Val Thr Thr Asp Thr Lys Phe Arg Asn Tyr Leu He Pro Lys Gly 
370 375 380 

Thr Thr He Met Ala Leu Leu Thr Ser Val Leu His Asp Asp Arg Glu 
385 390 395 ■ 400 

Phe Pro Asn Pro Asn He Phe Asp Pro Gly His Phe Leu Asp Lys Asn 
30 405 410 415 

Gly Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 



20 



25 



35 



45 



50 



Arg He Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
435 440 445 

Leu Thr Thr lie Leu Gin Asn Phe Asn Leu Lys Ser Val Asp Asp Leu 
450 455 460 

Lys Asn Leu Asn Thr Thr Ala Val Thr Lys Gly lie Val Ser Leu Pro 
465 470 475 480 

Pro Ser Tyr Gin He Cys Phe lie Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..1470 



txi) SEQUENCE DESCRIPTION : SEQ ID NO: : 

ATG GAA CCT TTT GTG GTC CTG GTG CTG TGT CTC TCT TTT ATG CTT CTC 
Met Glu Pro Phe Val Val Leu Val Leu Cys Leu Ser Phe Met Leu Leu 
1 S 10 15 

TTT TCA CTC TGG AGA CAG AGC TGT AGG AGA AGG AAG CTC CCT CCT GGC 
Phe Ser Leu Trp Arg Gin Ser Cys Arg Arg Arg Lys Leu Pro Pro Gly 
20 25 30 

CCC ACT CCT CTT CCT ATT ATT GGA AAT ATG CTA CAG ATA GAT GTT AAG 
Pro Thr Pro Leu Pro He He Gly Asn Met Leu Gin He Asp Val Lys 
35 40 45 

GAC ATC TGC AAA TCT TTC ACC AAT TTC TCA AAA GTC TAT GGT CCT GTG 
Asp He Cys Lys Ser Phe Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 
50 \ 55 60 

TTC ACC GTG TAT TTT GGC ATG AAT CCC ATA GTG GTG TTT CAT GGA TAT 
Phe Thr Val Tyr Phe Gly Met Asn Pro lie Val Val Phe His Gly Tyr 
65 70 75 80 

GTG GCA GTG AAG GAA GCC CTG ATT GAT AAT GGA GAG GAG TTT TCT GGA 
Val Ala Val Lys Glu Ala Leu lie Asp Asn Gly Glu Glu Phe Ser Gly 
85 90 95 

AGA GGC AAT TCC CCA ATA TCT CAA AGA ATT ACT AAA GGA CTT GGA ATC 
Arg Gly Asn Ser Pro lie Ser Gin Arg He Thr Lys Gly Leu Gly He 
100 105 HO 

ATT TCC AGC AAT GGA AAG AGA TGG AAG GAG ATC CGG CGT TTC TCC CTC 
lie Ser Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu 
115 120 125 

ACA ACC TTG CGG AAT TTT GGG ATG GGG AAG AAG AGC ATT GAG GAC CGT _ 
Thr Thr Leu Arg Asn Phe Gly Met Gly Lys Lys Ser He Glu Asp Arg " 
130 135 140 

GTT CAA GAG GAA GCT CAC TGC CTT GTG GAG GAG TTG AGA AAA ACC AAG 
Val Gin Glu Glu Ala His Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
145 150 155 160 

GCT TCA CCC TGT GAT CCC ACT TTC ATC CTG GGC TGT GCT CCC TGC AAT 
Ala Ser Pro Cvs Asp Pro Thr Phe lie Leu Gly Cys Ala Pro Cys Asr. 

165 170 175 

GTG ATC TGC TCC GTT GTT TTC CAG AAA CGA TTT GAT TAT AAA GAT CAG 
Val lie Cys Ser Val Val Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 

AAT TTT CTC ACC CTG ATG AAA AGA TTC AAT GAA AAC TTC AGG ATT CTG 
Asn Phe Leu Thr Leu Met Lys Arg Phe Asn Glu Asn Phe Arg He Leu 
195 200 205 
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25 



AAC TCC CCA TGG ATC CAG GTC TGC AAT AAT TTC CCT CTA CTC ATT GAT 672 

Asn Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Leu Leu He Asp 

210 215 220 

5 TGT TTC CCA GGA ACT ^ CAC AAC AAA GTG CTT AAT GTT GCT CTT ACA 720 

Cys Phe Pro Gly Thr\ His Asn Lys Val Leu Lys Asn Val Ala Leu Thr 
225 230 235 240 

CGA AGT TAC ATT AGG GAG AAA GTA AAA GAA CAC CAA GCA TCA CTG GAT 768 

Arg Ser Tyr He Arg Glu Lys Val Lys Glu His Gin Ala Ser Leu Asp 
10 245 250 255 

GTT AAC AAT CCT CGG GAC TTT ATC GAT TGC TTC CTG ATC AAA ATG GAG 816 

Val Asn Asn Pro Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu 
260 265 270 

75 CAG GAA AAG GAC AAC CAA AAG TCA GAA TTC AAT ATT GAA AAC TTG GTT 864 

Gin Glu Lys Asp Asn Gin Lys Ser Glu Phe Asn He Glu Asn Leu Val 

275 280 285 

GGC ACT GTA GCT GAT CTA TTT GTT GCT GGA ACA GAG ACA ACA AGC ACC 912 

Gly Thr Val Alalt Asp Leu Phe Val Ala Gly Thr Glu Thr Thr Ser Thr 

20 2 9 0 2 9 5 3 0 0 

ACT CTG AGA TAT GGA CTC CTG CTC CTG CTG AAG CAC CCA GAG GTC ACA 960 

Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

GCT AAA GTC CAG GAA GAG ATT GAT CAT GTA ATT GGC AGA CAC AGG AGC 1008 

Ala Lys Val Gin Glu Glu lie Asp His Val lie Gly Arg His Arg Ser 
325 330 335 

CCC TGC ATG CAG GAT AGG AGC CAC ATG CCT TAC ACT GAT GCT GTA GTG 1056 

Pro Cys Met Gin Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 345 1 350 

CAC GAG ATC CAG AGA TAC AGT GAC CTT GTC CCC ACC GGT GTG CCC CAT 1104 

His Glu lie Gin Arg Tyr Ser Asp Leu Val Pro Thr Gly Val Pro His 

355 360 365 

GCA GTG ACC ACT GAT ACT AAG TTC AGA AAC TAC CTC ATC CCC AAG GGC ' 1152 

Ala Val Thr Thr Asp Thr Lys Phe Arg Asn Tyr Leu He Pro Lys Gly 

370 375 380 

ACA ACC ATA ATG GCA TTA CTG ACT TCC GTG CTA CAT GAT GAC AGA GAA 1200 

Thr Thr lie Met Ala Leu Leu Thr Ser Val Leu His Asp Asp Arg Glu 
385 390 395 " 400 

TTT CCT AAT CCA AAT ATC TTT GAC CCT GGC CAC TTT CTA GAT AAG AAT 124 8 

Phe Pro Asn Pro Asn He Phe Asp Pro Gly His Phe Leu Asp Lys Asn 
405 * 410 415 

GGC AAC TTT AAG AAA AGT GAC TAC TTC ATG CCT TTC TCA GCA GGA AAA 12 96 

45 Gly Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 " " 425 430 

CGA ATT TGT GCA GGA GAA GGA CTT GCC CGC ATG GAG CTA TTT TTA TTT 1344 

Arg He Cys Ala Gly Glu Glv Leu Ala Arg Met Glu Leu Phe Leu Phe 

435 440 445 

50 



30 



35 



40 



55 



71 



EP 0 644 267 A2 



CTA ACC ACA ATT TTA CAG AAC TTT AAC CTG AAA TCT GTT GAT GAT TTA 
Leu Thr Thr lie Leu Gin Asn Phe Asn Leu Lys Ser Val Asp Asp Leu 
450 455 460 

AAG AAC CTC AAT ACT ACT GCA GTT ACC AAA GGG ATT GTT TCT CTG CCA 
Lys Asn Leu Asn Thr Thr Ala Val Thr Lys Gly He Val Ser Leu Pro 
465 470 475 4S0 

CCC TCA TAC CAG ATC TGC TTC ATC CCT GTC TGA 
Pro Ser Tyr Gin He Cys Phe He Pro Val 
485 490 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Glu Pro Phe Val Val Leu Val Leu Cys Leu Ser Phe Met Leu Leu 
1 5 10 15 

Phe Ser Leu Trp Arg Gin Ser Cys Arg Arg Arg Lys Leu Pro Pro Gly 
20 25 30 

Pro Thr Pro Leu Pro He He Gly Asn Met Leu Gin lie Asp Val Lys 
35 40 45 

Asp lie Cys Lys Ser Phe Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 
50 55 ; 60 

Phe Thr Val Tyr Phe Gly Met Asn Pro lie Val Val Phe His Gly Tyr 
65 70 75 80 

Val Ala Val Lys Glu Ala Leu lie Asp Asn Gly Glu Glu Phe Ser Gly 
85 90 95 

Arg Gly Asn Ser Pro He Ser Gin Arg He Thr Lys Gly Leu Gly lie 
100 105 110 

He Ser Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu 
115 120 125 

Thr Thr Leu Arg Asn Phe Gly Met Gly Lys Lys Ser He Glu Asp Arg 
130 135 140 

Val Gin Glu Glu Ala His Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
145 150 155 160 

Ala Ser Pro Cys Aso Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn 
165 170 175 
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Val He Cys Ser Val Val Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 

Asn Phe Leu Thr Leu Met Lys Arg Phe Asn Glu Asn Phe Arg He Leu 
195 200 205 

Asn Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Leu Leu He Asp 
210 215 220 

Cys Phe Pro Gly Thr His Asn Lys Val Leu Lys Asn Val Ala Leu Thr 
225 230 235 240 

Arg Ser Tyr He Arg Glu Lys Val Lys Glu His Gin Ala Ser Leu Asp 
245 250 255 

Val Asn Asn Pro Arg Asp Phe He Asp Cys Phe Leu lie Lys Met Glu 
260 265 270 

Gin Glu Lys Asp Asn Gin Lys Ser Glu Phe Asn He Glu Asn Leu Val 
275 280 285 

Gly Thr Val Ala Asp Leu Phe Val Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 300 

Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

Ala Lys Val Gin Glu Glu He Asp His Val He Gly Arg His Arg Ser 
325 330 335 

Pro Cys Met Gin Aso Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 " 345 350 

His Glu He Gin Arg Tyr Ser Asp Leu Val Pro Thr Gly Val Pro His 
355 360 365 

Ala Val Thr Thr Asp Thr Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly 
370 375 380 

Thr Thr lie Met Ala Leu Leu Thr Ser Val Leu His Asp Asp Arg Glu 
385 390 395 400 

Phe Pro Asn Pro Asn He Phe Asp Pro Gly His Phe Leu Asp Lys Asn 
405 410 415 

Gly Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 

Arg lie Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
435 440 445 

Leu Thr Thr He Leu Gin Asn Phe Asn Leu Lys Ser Val Asp Asp Leu 
450 455 460 

Lys Asn Leu Asn Thr Thr Ala Val Thr Lys Gly lie Val Ser Leu Pro 
465 470 475 480 

Pro Ser Tvr Gin lie Cys Phe lie Pro Val 
485 490 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 
(3) TYPE: nucleic acid 
5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
>0 (B) LOCATION: 1..1470 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG GAT CCA GCT GTG GCT CTG GTG CTC TGT CTC TCC TGT TTG TTT CTC 48 
75 Met Asp Pro Ala Val Ala Leu Val Leu Cys Leu Ser Cys Leu Phe Leu 
15 10 15 

CTT TCA CTC TGG AGG CAG AGC TCT GGA AG A GGG AGG CTC CCG TCT GGC 96 

Leu Ser Leu Trp Arg Gin Ser Ser Gly Arg Gly Arg Leu Pro Ser Gly 

20 "* 25 30 

20 

CCC ACT CCT CTC CCG ATT ATT GGA AAT ATC CTG CAG TTA GAT GTT AAG 144 

Pro Thr Pro Leu Pro lie lie Gly Asn He Leu Gin Leu Asp Val Lys 

35 40 45 

GAC ATG AGC AAA TCC TTA ACC AAT TTC TCA AAA GTC TAT GGC CCT GTG 192 
25 Asp Met Ser Lys Ser Leu Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 
50 55 60 

TTC ACT GTG TAT TTT GGC CTG AAG CCC ATT GTG GTG TTG CAT GGA TAT v 240 
Phe Thr Val Tyr Phe Gly Leu Lys Pro He Val Val Leu His Gly Tyr 
65 70 • 75 80 



30 



35 



40 



45 



GAA GCA GTG AAG GAG GCC CTG ATT GAT CAT GGA GAG GAG TTT TCT GGA 288 

Glu Ala Val Lys Glu Ala Leu He Asp His Gly Glu Glu Phe Ser Gly 
85 90 95 

AGA GGA AGT TTT CCA GTG GCT GAA AAA GTT AAC AAA GGA CTT GGA ATC 3 36 

Arg Gly Ser Phe Pro Val Ala Glu Lys Val Asn Lys Gly Leu Gly He 
100 105 110 

CTT TTC AGC AAT GGA AAG AGA TGG AAG GAG ATC CGG CGT TTC TGC CTC 3 84 

Leu Phe Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Cys Leu 
115 120 125 

ATG ACT CTG CGG AAT TTT GGG ATG GGG AAG AGG AGC ATC GAG GAC CGT 4 32 

Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser lie Glu Asp Arg 

130 135 140 

GTT CAA GAG GAA GCC CGC TGC CTT GTG GAG GAG TTG AGA AAA ACC AAT 4 80 

Val Gin Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Asn 
145 150 155 160 



50 
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GCC TCA CCC TGT GAT CCC ACT TTC ATC CTG GGC TGT GCT CCC TGC AAT 528 
Ala Ser Pro Cys Asp Pro Thr Phs lie Leu Gly Cys Ala Pro Cys Asn 
165 170 175 

5 GTG ATC TGC TCT GTT ATT TTC CAT GAT CGA TTT GAT TAT AAA GAT CAG 576 
Val He Cys Ser VaPlle Phe His Asp Arg Phe Asp Tyr Lys Asp Gin 
180 ' 185 " 190 

AGG TTT CTT AAC TTG ATG GAA AAA TTC AAT GAA AAC CTC AGG ATT CTG 624 
Arg Phe Leu Asn Leu Met Glu Lys Phe Asn Glu Asn Leu Arg He Leu 
10 195 200 205 

AGC TCT CCA TGG ATC CAG GTC TGC AAT AAT TTC CCT GCT CTC ATC GAT 672 
Ser Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Ala Leu He Asd 
210 215 220 

7 5 TAT CTC CCA GGA AGT CAT AAT AAA ATA GCT GAA AAT TTT GCT TAC ATT 720 
Tyr Leu Pro Gly Ser His Asn Lys He Ala Glu Asn Phe Ala Tyr He 
225 230 235 240 

AAA AGT TAT GTA TTG GAG AGA ATA AAA GAA CAT CAA GAA TCC CTG GAC 768 
Lys Ser Tyr VaJfc Leu Glu Arg lie Lys Glu His Gin Glu Ser Leu Aso 
20 245 250 255 

ATG AAC AGT GCT CGG GAC TTT ATT GAT TGT TTC CTG ATC AAA ATG GAA 816 
Met Asn Ser Ala Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu 
260 265 270 



25 



30 



35 



CAG GAA AAG CAC AAT CAA CAG TCT GAA TTT ACT GTT GAA AGC TTG ATA 864 
Gin Glu Lys His Asn Gin Gin Ser Glu Phe Thr Val Glu Ser Leu He 
275 280 285 

GCC ACT GTA ACT GAT ATG TTT GGG GCT GGA ACA GAG ACA ACG AGC ACC '* 912 
Ala Thr Val Thr Asp Met Phe Gly Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 1 300 

ACT CTG AGA TAT GGA CTC CTG CTC CTG CTG AAG TAC CCA GAG GTC ACA 960 
Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys Tyr Pro Glu Val Thr 
305 310 315 320 

GCT AAA GTC CAG GAA GAG ATT GAA TGT GTA GTT GGC AGA AAC CGG AGC "- 1008 
Ala Lys Val Gin Glu Glu He Glu Cys Val Val Gly Arg Asn Arg Ser 
325 330 335 

CCC TGT ATG CAG GAC AGG AGT CAC ATG CCC TAC ACA GAT GCT GTG GTG 1056 
Pro Cys Met Gin Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 345 ' 350 

CAC GAG ATC CAG AGA TAC ATT GAC CTC CTC CCC ACC AAC CTG CCC CAT 1104 
His Glu He Gin Arg Tyr He Asp Leu Leu Pro Thr Asn Leu Pro His 
355 360 365 

GCA GTG ACC TGT GAT GTT AAA TTC AAA AAC TAC CTC ATC CCC AAG GGC 1152 
Ala Val Thr Cys Asp Val Lys Phe Lys Asn Tyr Leu He Pro Lys Gly 
370 375 380 

ACG ACC ATA ATA ACA TCC CTG ACT TCT GTG CTG CAC AAT GAC AAA GAA 12 00 

Thr Thr He He Thr Ser Leu Thr Ser Val Leu His Asn Asp Lys Glu 
385 390 395 400 
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15 



TTC CCC AAC CCA GAG ATG TTT GAC CCT GGC CAC TTT CTG GAT AAG AGT 1248 
Phe Pro Asn Pro Glu Met Phe Asp Pro Gly Kis Phe Leu Asp Lys Ser 
405 410 415 

GGC AAC TTT AAG AAA AGT GAC TAC TTC ATG CCT TTC TCA GCA GGA AAA 1296 
Gly Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 * 425 430 

CGG ATG TGT ATG GGA GAG GGC CTG GCC CGC ATG GAG CTG TTT TTA TTC 134 4 

Arg Met Cys Met Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
435 440 445 

CTG ACC ACC ATT TTG CAG AAC TTT AAC CTG AAA TCT CAG GTT GAC CCA 1392 
Leu Thr Thr He Leu Gin Asn Phe Asn Leu Lys Ser Gin Val Asp Pro 
450 455 460 

AAG GAT ATT GAC ATC ACC CCC ATT GCC AAT GCA TTT GGT CGT GTG CCA 1440 
Lys Asp He Asp He Thr Pro He Ala Asn Ala Phe Gly Arg Val Pro 
465 470 475 480 

CCC TTG TAC CAG CTC TGC TTC ATT CCT GTC TGA 1473 
Pro Leu Tyr Gin i^eu Cys Phe He Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 90 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO': 28: 



20 



30 



35 



40 



Met Asp Pro Ala Val Ala Leu Val Leu Cys Leu Ser Cys Leu Phe Leu 
15 10 15 

Leu Ser Leu Trp Arg Gin Ser Ser Gly Arg Gly Arg Leu Pro Ser Gly 
20 25 30 

Pro Thr Pro Leu Pro He He Gly Asn He Leu Gin Leu Asp Val Lys 
35 40 45 

Asp Met Ser Lys Ser Leu Thr Asn Phe Ser Lys Val Tyr Gly Pro Val 
50 * 55 60 

Phe Thr Val Tyr Phe Gly Leu Lys Pro lie Val Val Leu His Gly Tyr 
65 70 75 80 

Glu Ala Val Lys Glu Ala Leu He Asp His Gly Glu Glu Phe Ser Gly 
85 90 95 

Arg Gly Ser Phe Pro Val Ala Glu Lys Val Asn Lys Gly Leu Gly He 
100 105 110 
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Leu ?he Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Cys Leu 
125 120 125 

Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser He Glu Asp Arg 
130 135 140 

Val Gin Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Asn 
145 150 155 160 

Ala Ser Pro Cys Asp Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn 
155 170 175 

Val He Cys Ser Val He Phe His Asp Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 

Arg Phe Leu Asn Leu Met Glu Lys Phe Asn Glu Asn Leu Arg He Leu 
195 200 205 

Ser Ser Pro Trp He Gin Val Cys Asn Asn Phe Pro Ala Leu He Asp 
210 . 215 220 

Tyr Leu Pro Gly Ser His Asn Lys He Ala Glu Asn Phe Ala Tyr He 
225 230 235 240 

Lys Ser Tyr Val Leu Glu Arg He Lys Glu His Gin Glu Ser Leu Asp 
245 250 255 

Met Asn Ser Ala Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu 
260 265 270 

Gin Glu Lys His Asn Gin Gin Ser Glu Phe Thr Val Glu Ser Leu. lie 
275 280 285 

Ala Thr Val Thr Asp Met Phe Gly Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 300 

Thr Leu Arg Tyr Gly Leu Leu Leu Leu Leu Lys Tyr Pro Glu Val Thr 
305 310 315 320 

Ala Lys Val Gin Glu Glu He Glu Cys Val Val Gly Arg Asn Arg Ser 
325 330 335 

Pro Cys Met Gin Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 
340 345 * 350 

His Glu lie Gin Arg Tvr lie hsp Leu Leu Pro Thr Asn Leu Pro His 
355 ' 360 365 

Ala Val Thr Cys Asp Val Lys Phe Lys Asn Tyr Leu He Pro Lys Gly 
370 " 375 380 

Thr Thr lie lie Thr Ser Leu Thr Ser Val Leu His Asn Asp Lys Glu 
385 3S0 395 400 

Phe Pre Asn Pro Glu Met Phe Asp Pro Gly His Phe Leu Asp Lys Ser 
405 * 410 415 

Gly Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 " 425 430 
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Arg Met Cys Met Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
435 440 445 

Leu Thr Thr lie Leu Gin Asn Phe Asn Leu Lys Ser Gin Val Asp Pro 
450 455 460 

Lys Asp lie Asp lie Thr Pro lie Ala Asn Ala Phe Gly Arg Val Pro 
465 470 475 480 

Pro Leu Tyr Gin Leu Cys Phe lie Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE :\ 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATG GAT CCT TTT GTG GTC CTT GTG CTC TGT CTC TCA TGT TTG CTT CTC 
Met Asp Pro Phe Val Val Leu Val Leu Cys Leu Ser Cys Leu Leu Leu 
15 10 15 

CTT TCA CTC TGG AG A CAG AGC TCT GGG AGA GGA AAA CTC CCT CCT GGC 
Leu Ser Leu Trp Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly 
20 25 ! 30 

CCC ACT CCT CTC CCA GTG ATT GGA AAT ATC CTA CAG ATA GAT ATT AAG 
Pro Thr Pro Leu Pro Val He Gly Asn He Leu Gin He Asp He Lys 
35 40 45 

GAT GTC AGC AAA TCC TTA ACC AAT CTC TCA AAA ATC TAT GGC CCT GTG 
Asp Val Ser Lys Ser Leu Thr Asn Leu Ser Lys He Tyr Gly Pro Val 
50 55 60 

TTC ACT CTG TAT TTT GGC CTC GAG CGC ATG GTG GTG CTG CAT GGA TAT 
Phe Thr Leu Tyr Phe Gly Leu Glu Arg Met Val Val Leu His Gly Tyr 
65 70 7S 80 

GAA GTG GTG AAG GAA GCC CTG ATT GAT CTT GGA GAG GAG TTT TCT GGA 
Glu Val Val Lys Glu Ala Leu He Asp Leu Gly Glu Glu Phe Ser Gly 
85 90 95 

AGA GGC CAT TTC CCA CTG GCT GAA AGA GCT AAC AGA GGA TTT GGA ATC 
Arg Gly His Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly He 
100 105 110 
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GTT TTC AGC AAT GGA AAG AGA TGG AAG GAG ATC CGG CGT TTC TCC CTC 3 84 

Val Phe Ser Asn Gly Lys Arg Trp Lys Glu lie Arg Arg Phe Ser Leu 
L15 120 125 

ATG ACG CTG CGG AAT TTT GGG ATG GGG AAG AGG AGC ATT GAG GAC CGT 432 
Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser lie Glu Asp Arg 
130 «^ 135 140 

GTT CAA GAG GAA GCC CGC TGC CTT GTG GAG GAG TTG AGA AAA ACC AAG 480 
Val Gin Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
145 150 155 160 

GCT TCA CCC TGT GAT CCC ACT TTC ATC CTG GGC TGT GCT CCC TGC AAT 528 
Ala Ser Pro Cys Asp Pro Thr Phe lie Leu Gly Cys Ala Pro Cys Asn 
165 170 175 

GTG ATC TGC TCC ATT ATT TTC CAG AAA CGT TTC GAT TAT AAA GAT CAG 576 
/5 Val lie Cys Ser lie lie Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 

CAA TTT CTT AAC TTG ATG GAA AAA TTG AAT GAA AAC ATC AGG ATT GTA 624 
Gin Phe Leu Asn Leu Met Glu Lys Leu Asn Glu Asn lie Arg lie Val 
195 * ; 200 205 



10 



20 



25 



30 



35 



AGC ACC CCC TGG ATC CAG ATA TGC AAT AAT TTT CCC ACT ATC ATT GAT 672 

Ser Thr Pro Trp lie Gin lie Cys Asn Asn Phe Pro Thr lie lie Asp 

210 215 220 

TAT TTC CCG GGA ACC CAT AAC AAA TTA CTT AAA AAC CTT GCT TTT ATG 72 0 

Tyr Phe Pro Gly Thr His Asn Lys Leu Leu Lys Asn Leu Ala Phe Met 

225 230 235 240 

GAA AGT GAT ATT TTG GAG AAA GTA AAA GAA CAC CAA GAA TCG ATG GAC 768 

Glu Ser Asp lie Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp 
245 250 255 

ATC AAC AAC CCT CGG GAC TTT ATT GAT TGC TTC CTG ATC AAA ATG GAG 816 

He Asn Asn Pro Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu 
260 265 270 

AAG GAA AAG CAA AAC CAA CAG TCT GAA TTC ACT ATT GAA AAC TTG GTA ' 864 

Lys Glu Lys Gin Asn Gin Gin Ser Glu Phe Thr He Glu Asn Leu Val 
275 280 285 

ATC ACT GCA GCT GAC TTA CTT GGA GCT GGG ACA GAG ACA ACA AGC ACA 912 

He Thr Ala Ala Asp Leu Leu Gly Ala Gly Thr Glu Thr Thr Ser Thr 

290 295 300 

ACC CTG AGA TAT GCT CTC CTT CTC CTG CTG AAG CAC CCA GAG GTC ACA 96 0 

Thr Leu Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 

305 310 315 320 

GCT AAA GTC CAG GAA GAG ATT GAA CGT GTC GTT GGC AGA AAC CGG AGC 100 8 

Ala Lys Val Gin Glu Glu He Glu Arg Val Val Gly Arg Asn Arg Ser 
325 330 335 

CCC TGC ATG CAG GAC AGG GGC CAC ATG CCC TAC ACA GAT GCT GTG GTG 1056 

Pro Cys Met Gin Asp Arg Gly His Met Pro Tyr Thr Asp Ala Val Val 
340 345 * 350 
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CAC GAG GTC CAG AGA TAC ATC GAC CTC ATC CCC ACC AGC CTG CCC CAT 1104 
His Glu Val Gin Arg Tyr lie Asp Leu lie Pro Thr Ser Leu Pro His 
355 360 365 

GCA GTG ACC TGT GAC GTT AAA TTC AGA AAC TAC CTC ATT CCC AAG GGC 1152 
Ala Val Thr Cys Asp Val Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly 
370 * 375 380 

ACA ACC ATA TTA ACT TCC CTC ACT TCT GTG CTA CAT GAC AAC AAA GAA 120 0 

Thr Thr lie Leu Thr Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu 
385 390 395 400 

TTC CCC AAC CCA GAG ATG TTT GAC CCT CGT CAC TTT CTG GAT GAA GGT 124 8 

Phe Pro Asn Pro Glu Met Phe Asp Pro Arg His Phe Leu Asp Glu Gly 
405 " 410 415 

GGA AAT TTT AAG AAA AGT AAC TAC TTC ATG CCT TTC TCA GCA GGA AAA 129 6 

'5 Gly Asn Phe Lys Lys Ser Asn Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 

CGG ATT TGT GTG GGA GAG GGC CTG GCC CGC ATG GAG CTG TTT TTA TTC # 1344 

Arg lie Cys Val =Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 

435 4 440 445 

20 

CTG ACC TTC ATT TTA . CAG AAC TTT AAC CTG AAA TCT CTG ATT GAC CCA 13 92 

Leu Thr Phe lie Leu Gin Asn Phe Asn Leu Lys Ser Leu lie Asp Pro 

450 455 460 

AAG GAC CTT GAC ACA ACT CCT GTT GTC AAT GGA TTT GCT TCT GTC CCG 144 0 

25 Lys Asp Leu Aso Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro 
465 * 470 475 480 

CCC TTC TAT CAG CTG TGC TTC ATT CCT GTC TGA ' : \ 14 73 

Pro Phe Tyr Gin Leu Cys Phe lie Pro Val 
485 490 \ 



30 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Asp Pro Phe Val Val Leu Val Leu Cys Leu Ser Cys Leu Leu Leu 
15 10 15 

Leu Ser Leu Trp Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly 
20 25 30 

Pro Thr Pro Leu Pro Val lie Gly Asn lie Leu Gin lie Asp lie Lys 
35 40 45 
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Asp Val Ser Lys Ser Leu Thr Asn Leu Ser Lys He Tyr Gly Pro Val 
50 55 60 

Phe Thr Leu Tyr Phe Gly Leu Glu Arg Met Val Val Leu His Gly Tyr 
65 70 75 80 

Glu Val Val Lys Glu Ala Leu He Asp Leu Gly Glu Glu Phe Ser Gly 
85 " 90 95 

10 Arg Gly His Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly He 

100 105 110 

Val Phe Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu 
115 120 125 

/5 Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser lie Glu Asp Arg 

130 135 140 

Val Gin Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 
145 ; 150 155 160 

20 Ala Ser Pro Cys Asp Pro Thr Phe lie Leu Gly Cys Ala Pro Cys Asn 

165 170 175 

Val lie Cys Ser He lie Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin 
180 185 190 



25 



30 



35 



40 



45 



50 



Gin Phe Leu Asn Leu Met Glu Lys Leu Asn Glu Asn lie Arg lie Val 
195 200 205 

Ser Thr Pro Trp He Gin lie Cys Asn Asn Phe Pro Thr He lie'- Asp 
210 215 220 

Tyr Phe Pro Gly Thr His Asn Lys Leu Leu Lys Asn Leu Ala Phe Met 
225 230 235 240 

Glu Ser Asp lie Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp 
245 250 255 

lie Asn Asn Pro Arg Asp Phe lie Asp Cys Phe Leu lie Lys Met Glu 
• 260 265 270 

Lys Glu Lys Gin Asn Gin Gin Ser Glu Phe Thr He Glu Asn Leu Val 
275 280 285 

He Thr Ala Ala Asp Leu Leu Gly Ala Gly Thr Glu Thr Thr Ser Thr 
290 295 300 

Thr Leu Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 
305 310 315 320 

Ala Lys Val Gin Glu Glu lie Glu Arg Val Val Gly Arg Asn Arg Ser 
325 330 335 

Pro Cys Met Gin Asp Arg Gly His Met Pro Tyr Thr Asp Ala Val Val 
340 345 350 

His Glu Val Gin Arg Tvr lie Asp Leu lie Pro Thr Ser Leu Pro His 
355 ' 360 365 
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20 



25 



30 



Ala Val Thr Cys Asp Val Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly 
370 375 380 

Thr Thr lie Leu Thr Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu 
385 390 395 400 

Phe Pro Asn Pro Glu Met Phe Asp Pro Arg His Phe Leu Asp Glu Gly 
405 410 415 

Gly Asn Phe Lys Lys Ser Asn Tyr Phe Met Pro Phe Ser Ala Gly Lys 
420 425 430 

Arg He Cys Val Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe 
435 440 445 

Leu Thr Phe He Leu Gin Asn Phe Asn Leu Lys Ser Leu He Asp Pro 
450 455 460 

Lys Asp Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro 
465 470 475 480 

Pro Phe Tyr Gin Leu Cys Phe He Pro Val 
485 490 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1491 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG GGG CTA GAA GCA CTG GTG CCC CTG GCC GTG ATA GTG GCC ATC TTC 4 8 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val He Val Ala He Phe 

15 10 15 

CTG CTC CTG GTG GAC CTG ATG CAC CGG CGC CAA CGC TGG GCT GCA CGC 96 

Leu Leu Leu Val Asp Leu Met His Arg Arg Gin Arg Tro Ala Ala Arg 

20 25 * 30 

TAC CCA CCA GGC CCC CTG CCA CTG CCC GGG CTG GGC AAC CTG CTG CAT 14 4 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 

3 5 4 0 4 5 

GTG GAC TTC CAG AAC ACA CCA TAC TGC TTC GAC CAG TTG CGG CGC CGC 192 

Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asp Gin Leu Arg Arg Arg 

50 55 60 
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TTC GGG GAC GTG TTC AGC CTG CAG CTG GCC TGG ACG CCG GTG GTC GTG 240 
?he Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 70 75 80 

5 CTC AAT GGG CTG GCG GCC GTG CGC GAG GCG CTG GTG ACC CAC GGC GAG 288 
Leu Asn Gly Leu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 
8$ 90 95 

GAC ACC GCC GAC CGC CCG CCT GTG CCC ATC ACC CAG ATC CTG GGT TTC 336 
Asp Thr Ala Asp Arg Pro Pro Val Pro lie Thr Gin lie Leu Gly Phe 
10 100 105 110 

'. GGG CCG CGT TCC CAA GGG GTG TTC CTG GCG CGC TAT GGG CCC GCG TGG 384 
Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala Trp 
115 120 125 

15 CGC GAG CAG AGG CGC TTC TCC GTC TCC ACC TTG CGC AAC TTG GGC CTG 432 
Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Gly Leu 
130 135 140 

GGC AAG AAG TCG CTG GAG CAG TGG GTG ACC GAG GAG GCC GCC TGC CTT 480 
Gly Lys Lys Seb: Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 
20 145 ' 150 155 160 

TGT GCC GCC TTC GCC AAC CAC TCC GGA CGC CCC TTT CGC CCC AAC GGT 528 
Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 
165 170 175 



25 



30 



35 



40 



45 



CTC TTG GAC AAA GCC GTG AGC AAC GTG ATC GCC TCC CTC ACC TGC GGG 576 

Leu Leu Asp Lys Ala Val Ser Asn Val lie Ala Ser Leu Thr Cys Gly 

180 185 190 

CGC CGC TTC GAA TAC GAC GAC CCT CGC TTC CTC AGG CTG CTG GAC CTA 624 

Arg Arg Phe Glu Tyr Asp Asp Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 1 205 

GCT CAG GAG GGA CTG AAG GAG GAG TCG GGC TTT CTG CGC GAG GTG CTG 672 

Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 

210 215 220 

AAT GCT GTC CCC GTC CTC CTG CAT ATC CCA GCG CTG GCT GGC AAG GTC 720 

Asn Ala Val Pro Val Leu Leu His lie Pro Ala Leu Ala Gly Lys Val 

225 230 235 240 

CTA CGC TTC CAA AAG GCT TTC CTG ACC CAG CTG GAT GAG CTG CTA ACT 768 

Leu Arg Phe Gin Lys Ala Phe Leu Thr Gin Leu Asp Glu Leu Leu Thr 

245 250 255 

GAG CAC AGG ATG ACC TGG GAC CCA GCC CAG CCC CCC CGA GAC CTG ACT 816 

Glu His Arg Met Thr Trp Asp Pro Ala Gin Pro Pro Arg Asp Leu Thr 

260 265 270 

GAG GCC TTC CTG GCA GAG ATG GAG AAG GCC AAG GGG AAC CCT GAG AGC 8 64 

Glu Ala Phe Leu Ala Glu Met Glu Lys Ala Lys Gly Asn Pro Glu Ser 
275 280 285 

AGC TTC AAT GAT GAG AAC CTG TGC ATA GTG GTG GCT GAC CTG TTC TCT 912 

Ser Phe Asn Asp Glu Asn Leu Cys He Val Val Ala Asp Leu Phe Ser 

290 295 300 



55 



83 



EP 0 644 267 A2 



GCC GGG ATG GTG ACC ACC TCG ACC ACG CTG GCC TGG GGC CTC CTG CTC 960 

Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 
305 310 315 320 

5 ATG ATC CTA CAT CCG GAT GTG CAG CGC CGT GTC CAA CAG GAG ATC GAC 1008 

Met He Leu His Pro Asp Val Gin Arg Arg Val Gin Gin Glu He Asp 

325 330 335 

GAC GTG ATA GGG CAG GTG CGG CGA CCA GAG ATG GGT GAC CAG GCT CAC 1056 

Aso Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asp Gin Ala His 
70 " 340 345 350 

ATG CCC TAC ACC ACT GCC GTG ATT CAT GAG GTG CAG CGC TTT GGG GAC 1104 

Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Asp 

355 360 365 

/5 ATC GTC CCC CTG GGT GTG ACC CAT ATG ACA TCC CGT GAC ATC GAA GTA 1152 

He Val Pro Leu Gly Val Thr His Met Thr Ser Arg Asp He Glu Val 

370 375 380 

CAG GGC TTC CGC ATC CCT AAG GGA ACG ACA CTC ATC ACC AAC CTG TCA -1200 

Gin Gly Phe Arg lie Pro Lys Gly Thr Thr Leu He Thr Asn Leu Ser 
385 390 395 400 



20 



25 



30 



35 



TCG GTG CTG AAG GAT GAG GCC GTC TGG GAG AAG CCC TTC CGC TTC CAC 1248 
Ser Val Leu Lys Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
405 410 415 

CCC GAA CAC TTC CTG GAT GCC CAG GGC CAC TTT GTG AAG CCG GAG GCC 1296 
Pro Glu His Phe Leu Asp Ala Gin Gly His Phe Val Lys Pro Glu Ala 
420 * 425 430 

TTC CTG CCT TTC TCA GCA GGC CGC CGT GCA TGC CTC GGG GAG CCC CTG 1344 
Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala >Cys Leu Gly Glu Pro Leu 
435 440 ' 445 

GCC CGC ATG GAG CTC TTC CTC TTC TTC ACC TCC CTG CTG CAG CAC TTC 13 92 

Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 

AGC TTC TCG GTG CCC ACT GGA CAG CCC CGG CCC AGC CAC CAT GGT GTC 144 0 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 470 475 480 

TTT GCT TTC CTG GTG ACC CCA TCC CCC TAT GAG CTT TGT GCT GTG CCC 1488 
Phe Ala Phe Leu Val Thr Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

CGC TAG 14 94 

Arq 



45 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 497 amino acids 
(3) TYPE: amino acid 

50 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val lie Val Ala He Phe 
1 5 io 15 

Leu Leu Leu Val Asp Leu Met His Arg Arg Gin Arg Trp Ala Ala Arg 
20 25 30 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 
35 40 45 

Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asd Gin Leu Arg Arq Arq 
50 55 "60 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 70 75 - 80 

Leu Asn Gly Leu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 
85 90 95 

Asp Thr Ala Asp Arg Pro Pro Val Pro He Thr Gin He Leu Gly Phe 
100 105 110 

Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala Trp 
115 120 125 

Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Glyi-Leu 
130 135 140 \ 

Gly Lys Lys Ser Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 
14 5 150 155 160 

Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 
165 170 175 

Leu Leu Asp Lys Ala Val Ser Asn Val He Ala Ser Leu Thr Cys Gly 
180 185 190 

Arg Arg Phe Glu Tyr Asp Asd Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 205 

Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 
210 215 220 

Asn Ala Val Pro Val Leu Leu His He Pro Ala Leu Ala Gly Lvs Val 
225 230 235 - 240 

Leu Arg Phe Gin Lys Ala Phe Leu Thr Gin Leu Asp Glu Leu Leu Thr 
245 250 " 255 

Glu His Arg Met Thr Trp Asp Pro Ala Gin Pro Pro Arg Aso Leu Thr 
260 265 270 

Glu Ala Phe Leu Ala Glu Met Glu Lys Ala Lys Gly Asn Pro Glu Ser 
275 280 . 285 
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25 
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35 



40 



45 



Ser Phe Asn Asp Glu Asn Leu Cys He Val Val Ala Asp Leu ?he Ser 
290 295 300 

Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 
305 ; 310 315 320 

Met He Leu His Pro Asp Val Gin Arg Arg Val Gin Gin Glu He Asp 
325 330 335 

Asp Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asd Gin Ala His 
340 345 * 350 

Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Asp 
355 360 365 

He Val Pro Leu Gly Val Thr His Met Thr Ser Arg Asp lie Glu Val 
370 375 380 

Gin Gly Phe Ar?g lie Pro Lys Gly Thr Thr Leu He Thr Asn Leu Ser 
385 ' 390 395 400 

Ser Val Leu Lys Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
405 410 415 

Pro Glu His Phe Leu Asp Ala Gin Gly His Phe Val Lys Pro Glu Ala 
420 . 425 430 

Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
435 440 445 

Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 470 475 480 

Phe Ala Phe Leu Val Thr Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

Arg 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acicl 

(C) STRANDEDNSSS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 
50 (A) NAME/KEY: CDS 

(B) LOCATION: 1 . .1491 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATG GGG CTA GAA GCA CTG GTG CCC CTG GCC GTG ATA GTG GCC ATC TTC 4 3 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val He Val Ala He Phe 
5 1 5, 10 15 

CTG CTC CTG GTG GAC CTG ATG CAC CGG CGC CAA CGC TGG GCT GCA CGC 95 

Leu Leu Leu Val Asp Leu Met His Arg Arg Gin Arg Tro Ala Ala Arg 
20 25 " 30 

10 TAC CCA CCA GGC CCC CTG CCA CTG CCC GGG CTG GGC AAC CTG CTG CAT 14 4 

.Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 

35 40 45 

GTG GAC TTC CAG AAC ACA CCA TAC TGC TTC GAC CAG TTG CGG CGC CGC 192 

Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asp Gin Leu Arg Arg Arg 
75 5 0 5 5 6 0 

TTC GGG GAC GTG TTC AGC CTG CAG CTG GCC TGG ACG CCG GTG GTC GTG 24 0 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 70 75 80 



20 



25 



30 



35 



45 



CTC AAT GGG CTG GCG GCC GTG CGC GAG GCG CTG GTG ACC CAC GGC GAG 2 88 

"eu Asn Gly Leu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 
85 90 95 

GAC ACC GCC GAC CGC CCG CCT GTG CCC ATC ACC CAG ATC CTG GGT TTC 336 
Asp Thr Ala Asp Arg Pro Pro Val Pro He Thr Gin He Leu Gly Phe 
100 105 110 

GGG CCG CGT TCC CAA GGG GTG TTC CTG GCG CGC TAT GGG CCC GCG TGG 3 84 

Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala TrD 

115 120 125 * " ! 

CGC GAG CAG AGG CGC TTC TCC GTC TCC ACC TTG CGC AAC TTG GGC CTG 432 
Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Gly Leu 
130 135 140 

GGC AAG AAG TCG CTG GAG CAG TGG GTG ACC GAG GAG GCC GCC TGC CTT 480 
Gly Lys Lys Ser Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 
145 150 155 160 

TGT GCC GCC TTC GCC AAC CAC TCC GGA CGC CCC TTT CGC CCC AAC GGT 52 8 

Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 

165 170 175 " * 

CTC TTG GAC AAA GCC GTG AGC AAC GTG ATC GCC TCC CTC ACC TGC GGG 576 
Leu Leu Asp Lys Ala Val Ser Asn Val He Ala Ser Leu Thr Cys Gly 
180 185 190 

CGC CGC TTC GAA TAC GAC GAC CCT CGC TTC CTC AGG CTG CTG GAC CTA - 6 24 

Arg Arg Phe Glu Tyr Asp Asp Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 205 

GCT CAG GAG GGA CTG AAG GAG GAG TCG GGC TTT CTG CGC GAG GTG CTG 6 72 

Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 
210 215 220 
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70 



AAT GCT GTC 
Asn Ala Val 
225 

CTA CGC TTC 
Leu Arg Phe 



GAG CAC AGG 
Glu His Arg 



CCC GTC CTC CTG CAT 
Pro Val Leu Leu His 
230 

CAA AAG GCT TTC CTG 
Gin Lys Ala Phe Leu 
24 5 \ 



GAG 
Glu 



GCC TTC 
Ala Phe 
275 



ATG 
Met 
260 

CTG 
Leu 



ACC TGG GAC CCA 
Thr Trp Asp Pro 



GCA GAG 
Ala Glu 



ATG GAG 
Met Glu 
280 



ATC CCA GCG CTG GCT GGC AAG GTC 
lie Pro Ala Leu Ala Gly Lys Val 
235 240 

ACC CAG CTG GAT GAG CTG CTA ACT 
Thr Gin Leu Asp Glu Leu Leu Thr 
250 255 

GCC CAG CCC CCC CGA GAC CTG ACT 
Ala Gin Pro Pro Arg Asp Leu Thr 
265 270 

AAG GCC AAG GGG AAC CCT GAG AGC 
Lys Ala Lys Gly Asn Pro Glu Ser 
285 



720 



768 



816 



664 



75 



AGC 
Ser 



TTC AAT 
Phe Asn 
290 



GAT 
Asp 



GAG AAC 
Glu Asn 



CTG CGC 
Leu Arg 
295 



ATA GTG 
He Val 



GTG GCT 
Val Ala 
300 



GAC CTG TTC TCT 
Asp Leu Phe Ser 



912 



20 



25 



30 



35 



GCC 
Ala 

"05 

ATG 
Met 



GGG ATG 
Gly Met 



ATC CTA 
He Leu 



GAC GTG ATA 
Asd Val He 



ATG CCC TAC 
Met Pro Tyr 
355 

ATC GTC CCC 
He Val Pro 
370 

CAG GGC TTC 
Gin Gly Phe 
385 

TCG GTG CTG 
Ser Val Leu 



CCC GAA CAC 
Pro Glu His 



GTG 
Val 



CAT 
His 



GGG 
Gly 
340 

ACC 
Thr 



CTG 
Leu 



CGC 
Arg 

AAG 
Lys 



TTC 
Phe 
420 



ACC ACC 
Thr Thr 
"* 310 

CCG GAT 
Pro Asp 
325 

CAG GTG 
Gin Val 



ACT GCC 
Thr Ala 



GGT GTG 
Gly Val 



ATC CCT 
He Pro 
390 

GAT GAG 
Asp Glu 
405 

CTG GAT 
Leu Asp 



TCG ACC 
Ser Thr 



GTG CAG 
Val Gin 



CGG CGA 
Arg Arg 



GTG ATT 
Val He 
360 

ACC CAT 
Thr His 
375 

AAG GGA 
Lys Gly 



GCC GTC 
Ala Val 



GCC CAG 
Ala Gin 



ACG CTG 
Thr Leu 



CGC CGT 
Arg Arg 
330 

CCA GAG 
Pro Glu 
345 

CAT GAG 
His Glu 



ATG 
Met 



ACG 
Thr 



TGG 
Trp 



GGC 
Gly 
425 



ACA 
Thr 



ACA 
Thr 



GAG 
Glu 
410 

CAC 
His 



GCC TGG 
Ala Trp 
315 

GTC CAA 
Val Gin 



ATG GGT 
Met Gly 



GTG CAG 
Val Gin 



TCC CGT 
Ser Arg 
380 

CTC ATC 
Leu He 
395 

AAG CCC 
Lys Pro 



TTT GTG 
Phe Val 



GGC CTC 
Gly Leu 



CAG GAG 
Gin Glu 



GAC CAG 
Asd Gin 
350 

CGC TTT 
Arg Phe 
365 

GAC ATC 
Asp He 



ACC AAC 
Thr Asn 



TTC CGC 
Phe Arg 



AAG CCG 
Lys Pro 
430 



CTG CTC 
Leu Leu 
320 

ATC GAC 
lie Asd 
335 

GCT CAC 
Ala His 



GGG GAC 
Gly Asp 



GAA GTA 
Glu Val 



CTG TCA 
Leu Ser 
400 

TTC CAC 
Phe His 
415 

GAG GCC 
Glu Ala 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



45 



TTC CTG CCT TTC TCA GCA GGC CGC CGT GCA TGC CTC GGG GAG CCC CTG 
Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
435 440 445 



1344 



GCC CGC ATG 
Ala Arg Met 
450 



GAG CTC TTC CTC TTC 
Glu Leu Phe Leu Phe 
455 



TTC ACC TCC CTG CTG CAG CAC TTC 
Phe Thr Ser Leu Leu Gin His Phe 
460 



1392 



50 



88 
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AGC TTC TCG GTG CCC ACT GGA CAG CCC CGG CCC AGC CAC CAT GGT GTC 144 0 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 470 475 480 

TTT GCT TTC CTG GTG ACC CCA TCC CCC TAT GAG CTT TGT GCT GTG CCC 14 68 

Phe Ala Phe Leu Val*, Thr Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

CGC TAG 14 94 

Arg 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 497 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val He Val Ala He Phe 
15 10 15 

Leu Leu Leu Val Asp Leu Met His Arg Arg Gin Arg Trp Ala Ala Arg 
25 2 0 2 5 3 0 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 
35 40 45 

Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asd Gin Leu Arg Arg Arg 
30 50 55 * 60 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 70 75 80 

Leu Asn Gly Leu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 
35 85 90 95 

Asp Thr Ala Asp Arg Pro Pro Val Pro He Thr Gin lie Leu Gly Phe 
100 105 110 

Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala Trp 
w 115 120 125 

Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Gly Leu 
130 135 140 

Gly Lys Lys Ser Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 
45 145 150 155 160 

Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 
165 170 175 



50 



55 



89 
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w 



Leu Leu Asp Lys Ala Val Ser Asn Val He Ala Ser Leu Thr Cys Gly 
180 185 190 

Arg Arg Phe Glu Tyr Asp Asp Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 205 

Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 
210 215 220 

Asn Ala Val Pro Val Leu Leu His He Pro Ala Leu Ala Gly Lys Val 
225 230 235 240 

Leu Arg Phe Gin Lys Ala Phe Leu Thr Gin Leu Aso Glu Leu Leu Thr 
245 250 * 255 



15 



Glu His Arg Met Thr Trp Asp Pro Ala Gin Pro Pro Arg Asp Leu Thr 
260 265 270 



Glu Ala Phe Leu Ala Glu Met Glu Lys Ala Lys Gly Asn Pro Glu Ser 

275 280 285 

20 Ser Phe Asn Asp Glu Asn Leu Arg He Val Val Ala Asp Leu Phe Ser 

290 295 300 

Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 

305 310 315 320 



25 



30 



35 



Met He Leu His Pro Asp Val Gin Arg Arg Val Gin Gin Glu lie Asp 
325 330 335 

Asp Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asp Gin Ala' : His 
340 345 350 

Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Asp 
355 360 365 

lie Val Pro Leu Gly Val Thr His Met Thr Ser Arg Asp lie Glu Val 
370 375 380 

Gin Gly Phe Arg lie Pro Lys Gly Thr Thr Leu He Thr Asn Leu Ser 
385 390 395 400 



40 



Ser Val Leu Lys Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
405 410 ' 415 

Pro Glu His Phe Leu Asp Ala Gin Gly His Phe Val Lys Pro Glu Ala 
420 425 430 

Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
435 440 445 

Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 



Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 470 475 480 



55 



90 
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Phe Ala Phe Leu Val Thr Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

Arg 



(2) INFORMATION FOR, SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 94 base pairs 

(B) TYPE: nucleic acid 

10 {C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<ix) FEATURE: 

(A) NAME/KEY: CDS 
75 (B) LOCATION: 1. .14 91 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATG GGG CTA GA& GCA CTG GTG CCC CTG GCC GTG ATA GTG GCC ATC TTC 4 8 

20 Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val lie Val Ala He Phe 

15 10 15 

CTG CTC CTG GTG GAC CTG ATG CAC CGG CGC CAA CGC TGG GCT GCA CGC 9 6 

Leu Leu Leu Val Asu Leu Met His Arg Arg Gin Arg Trp Ala Ala Arg 

20 " 25 30 

25 TAC CCA CCA GGC CCC CTG CCA CTG CCC GGG CTG GGC AAC CTG CTG CAT 144 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 

35 40 45 

GTG GAC TTC CAG AAC ACA CCA TAC TGC TTC GAC CAG TTG CGG CGC CGC '192 

Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asp Gin Leu Arg Arg Arg 

50 55 " 60 

TTC GGG GAC GTG TTC AGC CTG CAG CTG GCC TGG ACG CCG GTG GTC GTG 24 0 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 

65 70 75 80 

35 CTC AAT GGG CTG GCG GCC GTG CGC GAG GCG CTG GTG ACC CAC GGC GAG 288 

Leu Asn Gly Leu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 

85 90 95 

GAC ACC GCC GAC CGC CCG CCT GTG CCC ATC ACC CAG ATC CTG GGT TTC 33 6 

Asp Thr Ala Asp Arg Pro Pro Val Pro He Thr Gin He Leu Gly Phe 

40 100 105 110 

GGG CCG CGT TCC CAA GGG GTG TTC CTG GCG CGC TAT GGG CCC GCG TGG 3 84 

Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala Trp 

115 120 125 

45 CGC GAG CAG AGG CGC TTC TCC GTC TCC ACC TTG CGC AAC TTG GGC CTG 43 2 

Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Gly Leu 

130 135 140 



SO 



55 



91 
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GGC AAG AAG TCG CTG GAG CAG TGG GTG ACC GAG GAG GCC GCC TGC CTT 4 80 

Giy Lys Lys Ser Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 
145 150 155 160 

5 TGT GCC GCC TTC GCC AAC CAC TCC GGA CGC CCC TTT CGC CCC AAC GGT 528 
Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 
165 4 170 175 

CTC TTG GAC AAA GCC GTG AGC AAC GTG ATC GCC TCC CTC ACC TGC GGG 576 
Leu Leu Asp Lys Ala Val Ser Asn Val lie Ala Ser Leu Thr Cys Gly 
70 180 185 190 

CGC CGC TTC GAA TAC GAC GAC CCT CGC TTC CTC AGG CTG CTG GAC CTA 624 
Arg Arg Phe Glu Tyr Asp Asp Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 205 



75 



20 



25 



GCT CAG GAG GGA CTG AAG GAG GAG TCG GGC TTT CTG CGC GAG GTG CTG 672 
Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 
210 215 220 

AAT GCT GTC CCC GTC CTC CTG CAT ATC CCA GCG CTG GCT GGC AAG GTC 720 
Asn Ala Val Pro Val Leu Leu His He Pro Ala Leu Ala Gly Lys Val 
225 1 230 235 240 

CTA CGC TTC CAA AAG GCT TTC CTG ACC CAG CTG GAT GAG CTG CTA ACT 768 
Leu Arg Phe Gin Lys Ala Phe Leu Thr Gin Leu Asn Glu Leu Leu Thr 
245 250 * 255 

GAG CAC AGG ATG ACC TGG GAC CCA GCC CAG CCC CCC CGA GAC CTG ACT 816 
Glu His Arg Met Thr Trp Asp Pro Ala Gin Pro Pro Arg Aso Leu Thr 
260 265 270 

GAG GCC TTC CTG GCA GAG ATG GAG AAG GCC AAG GGG AAC CCT GAG AGC <\ 864 
Glu Ala Phe Leu Ala Glu Met Glu Lys Ala Lys Gly Asn Pro Glu Ser 
275 280 \ 285 

AGC TTC AAT GAT GAG AAC CTG CGC ATA GTG GTG GCT GAC CTG TTC TCT 912 
Ser Phe Asn Asp Glu Asn Leu Arg lie Val Val Ala Asp Leu Phe Ser 
290 295 300 

GCC GGG ATG GTG ACC ACC TCG ACC ACG CTG GCC TGG GGC CTC CTG CTC 9 60 

35 Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 
305 ' 310 315 320 

ATG ATC CTA CAT CCG GAT GTG CAG CGC CGT GTC CAA CAG GAG ATC GAC 1008 

Met He Leu His Pro Asp Val Gin Arg Arg Val Gin Gin Glu He Asp 

325 330 335 

40 

GAC GTG ATA GGG CAG GTG CGG CGA CCA GAG ATG GGT GAC CAG GCT CAC 105 6 

Asp Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asp Gin Ala His 

340 345 350 



30 



45 



ATG CCC TAC ACC ACT GCC GTG ATT CAT GAG GTG CAG CGC TTT GGG GAC 1104 
Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Asp 
355 360 365 

ATC GTC CCC CTG GGT GTG ACC CAT ATG ACA TCC CGT GAC ATC GAA GTA 1152 
He Val Pro Leu Gly Val Thr His Met Thr Ser Arg Asp He Glu Val 
370 375 380 



50 



55 



92 
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w 



15 



20 



25 



CAG GGC TTC CGC ATC CCT AAG GGA ACG ACA CTC ATC ACC AAC CTG TCA 1200 
Gin Gly Phe Arg He Pro Lys Giy Thr Thr Leu He Thr Asn Leu Ser 
385 390 395 400 

TCG GTG CTG AAG GAT GAG GCC GTC TGG GAG AAG CCC TTC CGC TTC CAC 124 8 

Ser Val Leu Lys Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
40*5 410 " 415 

CCC GAA CAC TTC CTG GAT GCC CAG GGC CAC TTT GTG AAG CCG GAG GCC 1296 
Pro Glu His Phe Leu Asp Ala Gin Gly His Phe Val Lys Pro Glu Ala 
420 425 430 

TTC CTG CCT TTC TCA GCA GGC CGC CGT GCA TGC CTC GGG GAG CCC CTG 1344 
Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
435 440 445 

GCC CGC ATG GAG CTC TTC CTC TTC TTC ACC TCC CTG CTG CAG CAC TTC 13 92 

Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 

AGC TTC TCG GTG CCC ACT GGA CAG CCC CGG CCC AGC CAC CAT GGT GTC 144 0 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 1 470 475 480 

TTT GCT TTC CTG GTG AGC CCA TCC CCC TAT GAG CTT TGT GCT GTG CCC 148 8 

Phe Ala Phe Leu Val Ser Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

CGC TAG 14 94 

Arg 



35 



(2) INFORMATION FOR SEQ ID NO: 36: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val He Val Ala He Phe 
15 10 15 

40 Leu Leu Leu Val Aso Leu Met His Arg Arg Gin Arg Trp Ala Ala Arg 
20 * 25 30 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 
35 40 45 

45 Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asp Gin Leu Arg Arg Arg 
50 55 60 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 ' 70 75 80 

50 



55 



93 
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Leu Asn Gly Leu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 
85 SO 95 

Asp Thr Ala Asp Arg Pro Pro Val Pro He Thr Gin He Leu Gly Phe 
100 \ 105 110 

Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala Trp 
115 120 125 



70 



Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Gly Leu 
130 135 140 



75 



Gly Lys Lys Ser Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 

145 150 155 160 

Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 

165 170 175 



20 



25 



Leu Leu Asp Lys Ala Val Ser Asn Val He Ala Ser Leu Thr Cys -Gly 
iSO 185 190 

Arg Arg Phe Glu Tyr Asp Asp Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 205 

Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 
210 215 220 

Asn Ala Val Pro Val Leu Leu His He Pro Ala Leu Ala Gly Lys Val 
225 230 235 ,-.24 0 



30 



Leu Arg Phe Gin Lys Ala Phe Leu Thr Gin Leu Asp Glu Leu Leu Thr 
245 ' 250 255 



Glu His Arc Met Thr Trp Asp Pro Ala Gin Pro Pro Arg Asp Leu Thr 
260 265 270 



35 



Glu Ala Phe Leu Ala Glu Met Glu Lys Ala Lys Gly Asn Pro Glu Ser 
275 280 285 



Ser Phe Asn Asp Glu Asn Leu Arg He Val Val Ala Asp Leu Phe Ser 

290 295 300 

40 Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 

305 310 315 320 

Met lie Leu His Pro Asp Val Gin Arg Arg Val Gin Gin Glu He Asp 

325 " 330 335 



45 



Asp Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asp Gin Ala His 
340 345 350 



50 



Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Asp 

355 360 365 

He Val Fro Leu Gly Val Thr His Met Thr Ser Arg Asp He Glu Val 

370 375 380 



55 



94 
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Gin Gly ?he Arg He Pro Lys Gly Thr Thr Leu He Thr Asn Leu Ser 
385 390 395 400 

Ser Val Leu Lys Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
5 405 410 415 

Pro Glu His Phe Leu Asp Ala Gin Gly Kis Phe Val Lys Pro Glu Ala 
420 425 430 

Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
10 435 440 445 

'.Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
75 465 470 475 480 

Phe Ala Phe Leu Val Ser Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 



20 



25 



30 



Arg \ 

(2) INFORMATION FOR SEQ ID NO: 37: 

{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .14 91 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATG GGG CTA GAA GCA CTG GTG CCC CTG GCC GTG ATA GTG GCC ATC TTC 4 8 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val He Val Ala He Phe 
15 10 15 

CTG CTC CTG GTG GAC CTG ATG CAC CGG CGC CAA CGC TGG GCT GCA CGC 9 6 

Leu Leu Leu Val Asp Leu Met His Arg Arg Gin Arg Trp Ala Ala Arg 
20 25 30 

TAC CCA CCA GGC CCC CTG CCA CTG CCC GGG CTG GGC AAC CTG CTG CAT 14 4 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 
3 5 4 0 4 5 

GTG GAC TTC CAG AAC ACA CCA TAC TGC TTC GAC CAG TTG CGG "tGC CGC 192 
45 Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asp Gin Leu Arg Arg Arg 
50 55 * 60 

TTC GGG GAC GTG TTC AGC CTG CAG CTG GCC TGG ACG CCG GTG GTC GTG 24 0 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 70 75 30 

50 



35 



40 



55 
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CTC AAT GGG 
Leu Asn Gly 



CTG 
Leu 



GCG GCC 
Ala Ala 
85 



GTG CGC 
Val Arg 



GAG 
Glu 



GCG 
Ala 
90 



CTG 
Leu 



GTG 
Val 



ACC 
Thr 



CAC GGC GAG 
His Gly Glu 
95 



288 



GAC ACC GCC 
Asp Thr Ala 



GAC 
Asp 
100 



CGC CCG 
Arg Pro 



CCT GTG 
Pro Val 



CCC 
Pro 
105 



ATC 
lie 



ACC 
Thr 



CAG 
Gin 



ATC 
lie 



CTG GGT TTC 
Leu Gly Phe 
110 



336 



70 



GGG CCG 
Gly Pro 



CGC GAG 
Arg Glu 
130 



CGT 
Arc 
115 

CAG 
Gin 



TCC 
Ser 



AGG 
Arg 



CAA GGG 
Gin Gly 



CGC TTC 
Arg Phe 



GTG 
Val 



TCC 
Ser 
135 



TTC 
Phe 
120 

GTC 
Val 



CTG 
Leu 



TCC 
Ser 



GCG 
Ala 



ACC 
Thr 



CGC 
Arg 



TTG 
Leu 



TAT 
Tyr 



CGC 
Arg 
140 



GGG 
Gly 
125 

AAC 
Asn 



CCC GCG TGG 
Pro Ala Trp 



TTG GGC CTG 
Leu Gly Leu 



384 



432 



75 



GGC AAG 
Gly Lys 
145 



AAG 
Lys 



TCG 
Ser 



CTG GAG 
Leu Glu 
150 



CAG 
Gin 



TGG 
Trp 



GTG 
Val 



ACC 
Thr 



GAG 
Glu 
155 



GAG 
Glu 



GCC 
Ala 



GCC TGC CTT 
Ala Cys Leu 
160 



480 



20 



TGT GCC 
Cys Ala 



GCC 
Ala 



CTC TTG GAC 
Leu Leu Aso 



TTC 
Phe 



AAA 
Lys 
130 



GCC AAC 
Ala Asn 
[165 

GCC GTG 
Ala: Val 



CAC TCC 
His Ser 



AGC AAC 
Ser Asn 



GGA 
Gly 



GTG 
Val 
185 



CGC 
Arg 
170 

ATC 
lie 



CCC 
Pro 



GCC 
Ala 



TTT 
Phe 



TCC 
Ser 



CGC 
Arg 



CTC 
Leu 



CCC AAC GGT 
Pro Asn Gly 
175 

ACC TGC GGG 
Thr Cys Gly 
190 



528 



576 



25 



CGC 
Arg 



CGC TTC 
Arg Phe 
195 



GAA 
Glu 



TAC GAC 
Tyr Asp 



GAC CCT 
Asp Pro 
200 



CGC 
Arg 



TTC 
Phe 



CTC 
Leu 



AGG 
Arg 



CTG 
Leu 
205 



CTG GAC CTA 
Leu Asp Leu 



624 



30 



GCT 
Ala 



AAT 
Asn 
225 



CAG GAG 
Gin Glu 
210 

GCT GTC 
Ala Val 



GGA 
Gly 



CTG AAG 
Leu Lys 



CCC 
Pro 



GTC CTC 
Val Leu 
230 



GAG GAG 
Glu Glu 
215 

CTG .CAT 
Leu His 



TCG 
Ser 



ATC 
He 



GGC 
Gly 



CCA 
Pro 



TTT 
Phe 



GCG 
Ala 
235 



CTG 
Leu 
220 

CTG 
Leu 



CGC 
Arg 



GAG GTG CTG 
Glu Val Leu 



GCT GGC 
Ala Gly 



AAG GTC 
Lys Val 
240 



672 



720 



35 



CTA 
Leu 



CGC TTC 
Arg Phe 



CAA 
Gin 



AAG GCT 
Lys Ala 
245 



TTC CTG 
Phe Leu 



ACC 
Thr 



CAG 
Gin 
250 



CTG 
Leu 



GAT 
Asp 



GAG CTG 
Glu Leu 



CTA ACT 
Leu Thr 
255 



768 



40 



GAG CAC AGG 
Glu His A.rg 



GAG GCC TTC 
Glu Ala Phe 
275 



ATG 
Me t 
260 

CTG 
Leu 



ACC TGG 
Thr Trp 



GCA GAG 
Ala Glu 



GAC CCA 
Asp Pro 



ATG GAG 
Met Glu 
280 



GCC 
Ala 
265 

AAG 
Lys 



CAG 
Gin 



GCC 
Ala 



CCC 
Pro 



AAG 
Lys 



CCC 
Pro 



GGG 
Gly 



CGA GAC 
Arg Asp 
270 

AAC CCT 
Asn Pro 
285 



CTG ACT 
Leu Thr 



GAG AGC 
Glu Ssr 



816 



864 



45 



AGC TTC AAT 
Ser Phe Asn 
290 



GAT 
Aso 



GAG AAC 
Glu Asn 



CTG TGC 
Leu Cys 
295 



ATA 
He 



GTG 
Val 



GTG 
Val 



GCT 
Ala 
300 



GAC CTG 
Asp Leu 



TTC TC" 
Phe Ser 



912 



50 



55 



96 
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GCC GGG ATG GTG ACC ACC TCG ACC ACG CTG GCC TGG GGC CTC CTG CTC 960 
Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 
305 310 315 320 

ATG ATC CTA CAT CCG GAT GTG CAG CGC CGT GTC CAA CAG GAG ATC GAC 1008 
Met lie Leu Kis Pro Asp Val Gin Arg Arg Val Gin Gin Glu lie Asa 
325 % 330 335 

GAC GTG ATA GGG CAG GTG CGG CGA CCA GAG ATG GGT GAC CAG GCT CAC 1056 
Asp Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asp Gin Ala His 
340 345 350 

ATG CCC TAC ACC ACT GCC GTG ATT CAT GAG GTG CAG CGC TTT GGG GAC 1104 
Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Aso 
355 360 365 

ATC GTC CCC CTG GGT GTG ACC CAT ATG ACA TCC CGT GAC ATC GAA GTA 1152 
He Val Pro Leu Gly Val Thr His Met Thr Ser Arg Asp He Glu Val 
370 375 380 

CAG GGC TTC CGC ATC CCT AAG GGA ACG ACA CTC ATC ACC AAC CTG TCA 1200 
Gin Gly Phe Arg ; He Pro Lys Gly Thr Thr Leu He Thr Asn Leu Ser 
385 390 395 400 

TCG GTG CTG AAG GAT GAG GCC GTC TGG GAG AAG CCC TTC CGC TTC CAC 124 8 

Ser Val Leu Lys Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
405 410 415 

CCC GAA CAC TTC CTG GAT GCC CAG GGC CAC TTT GTG AAG CCG GAG GCC 1296 
25 Pro Glu His Phe Leu Asp Ala Gin Gly His Phe Val Lys Pro Glu Ala 
420 425 430 

TTC CTG CCT TTC TCA GCA GGC CGC CGT GCA TGC CTC GGG GAG CCC CTG v 1344 
Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
435 440 • 445 



15 



20 



30 



35 



GCC CGC ATG GAG CTC TTC CTC TTC TTC ACC TCC CTG CTG CAG CAC TTC 13 92 

Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 

AGC TTC TCG GTG CCC ACT GGA CAG CCC CGG CCC AGC CAC CAT GGT GTC 144 0 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 470 475 480 

TTT GCT TTC CTG GTG AGC CCA TCC CCC TAT GAG CTT TGT GCT GTG CCC 148 8 

Phe Ala Phe Leu Val Ser Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

CGC TAG 14 94 

Arg 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 497 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 
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15 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Gly Leu Glu Ala Leu Val Pro Leu Ala Val lie Val Ala lie Phe 
1 " 5 10 15 

Leu Leu Leu Val Asp Leu Met His Arg Arg Gin Arg Trp Ala Ala Arg 
20 25 30 

Tyr Pro Pro Gly Pro Leu Pro Leu Pro Gly Leu Gly Asn Leu Leu His 
35 40 45 

Val Asp Phe Gin Asn Thr Pro Tyr Cys Phe Asp Gin Leu Arg Arg Arg 
50 55 60 

Phe Gly Asp Val Phe Ser Leu Gin Leu Ala Trp Thr Pro Val Val Val 
65 70 75 80 

Leu Asn Gly I*eu Ala Ala Val Arg Glu Ala Leu Val Thr His Gly Glu 
85 90 95 

Asp Thr Ala Asp Arg Pro Pro Val Pro He Thr Gin He Leu Gly Phe 
100 105 110 

Gly Pro Arg Ser Gin Gly Val Phe Leu Ala Arg Tyr Gly Pro Ala Trp 
25 120 125 

Arg Glu Gin Arg Arg Phe Ser Val Ser Thr Leu Arg Asn Leu Gly. Leu 
130 135 140 

Gly Lys Lys Ser Leu Glu Gin Trp Val Thr Glu Glu Ala Ala Cys Leu 
30 145 150 155 160 

Cys Ala Ala Phe Ala Asn His Ser Gly Arg Pro Phe Arg Pro Asn Gly 
165 170 175 



20 



35 



Leu Leu Asp Lys Ala Val Ser Asn Val He Ala Ser Leu Thr Cys Gly 
180 185 190 

Arg Arg Phe Glu Tyr Asp Asp Pro Arg Phe Leu Arg Leu Leu Asp Leu 
195 200 205 

40 Ala Gin Glu Gly Leu Lys Glu Glu Ser Gly Phe Leu Arg Glu Val Leu 

210 215 220 

Asn Ala Val Pro Val Leu Leu His He Pro Ala Leu Ala Gly Lys Val 
225 230 235 240 



45 



50 



Leu Arg Phe Gin Lys Ala Phe Leu Thr Gin Leu Asp Glu Leu Leu Thr 

245 250 255 

Glu His Arg Met Thr Trp Asp Pro Ala Gin Pro Pro Arg Asp Leu Thr 

260 * 265 " 270 

Glu Ala Phe Leu Ala Glu Met Glu Lys Ala Lys Gly Asn Pro Glu Ser 

275 280 ^ ' 285 
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Ser Phe Asn Asp Glu Asn Leu Cys lie Val Val Ala Asp Leu Phe Ser 
290 295 300 

Ala Gly Met Val Thr Thr Ser Thr Thr Leu Ala Trp Gly Leu Leu Leu 
305 310 315 320 

Met He Leu His Pro^Asp Val Gin Arg Arg Val Gin Gin Glu He Asp 
325 330 335 

Asp Val He Gly Gin Val Arg Arg Pro Glu Met Gly Asp Gin Ala His 
340 345 350 

Met Pro Tyr Thr Thr Ala Val He His Glu Val Gin Arg Phe Gly Asp 
355 360 365 

He Val Pro Leu Gly Val Thr Kis Met Thr Ser Arg Asp He Glu Val 
370 375 380 

Gin Gly Phe Arg He Pro Lys Gly Thr Thr Leu He Thr Asn Leu Ser 
385 390 395 400 

Ser Val Leu Lys.. Asp Glu Ala Val Trp Glu Lys Pro Phe Arg Phe His 
'405 410 415 

Pro Glu Kis Phe Leu Asp Ala Gin Gly His Phe Val Lys Pro Glu Ala 
420 425 430 

Phe Leu Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly Glu Pro Leu 
435 440 445 

Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Ser Leu Leu Gin His Phe 
450 455 460 

Ser Phe Ser Val Pro Thr Gly Gin Pro Arg Pro Ser His His Gly Val 
465 470 \ 475 480 

Phe Ala Phe Leu Val Ser Pro Ser Pro Tyr Glu Leu Cys Ala Val Pro 
485 490 495 

Arg 

(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



45 



(xi) SEQUENCE 4 DESCRIPTION : SEQ ID NO: 39: 
GGAACGCATG GTGGTGCTGC ATGGATATGA AGTG 34 



50 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

70 

CTCAAAGATC TATGGCCCTG TGTTCACTCT GTATTTTGGC CTCGAGCGCA TGGTGG 56 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCACCATGCG CTCGAGGCCA AAATACAG 2 8 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 'C 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GGGTTCCCGG GAAATAATCA ATGATAGTGG G '31 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGT+k 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
45 GGATTGTAAG CACCCCCTGG ATCCAGATAT GC 3 2 
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<2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
.CCCAGCTCCA AGTAAGTCAG CTGCAGTGAT TACC 34 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGTGGTACCC TTGGGAATGA GGTAGTTTCT GAATTTAACG TC 4 2 

£2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 46: 
AGTCTAGAAT GGATCCTTTT GTGGTCCTTG TGC 33 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCCAGAGCTC TGTCTCCAGA GTGAAAGGAG 3 0 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND&DNESS : single 

(D) TOPOLOGY: linear 



10 (xi| SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

ACAGAGCTCT GGGAGAGGAA AACTCCCTCC 30 
(2) INFORMATION FOR SEQ ID NO: 49: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CCATAGATTT TTGAGAGATT GGTTAAGGAT TTGCTGACAT CCTTAATATC TATC 54 
25 (2) INFORMATION FOR SEQ ID NO: 50: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(i) 

30 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 50: 
GACCCTCGTC ACTTTCTGGA TGAAGGTGGA 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GAAGTAGTTA CTTTTCTTAA AATTTCCACC TTCATC 3 6 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
AAAGAATTCC CCAACCCAGA GATGTTTGAC CCTCGTC 37 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGCCAGGCCC TCTCCCACAC AAATCCGTTT TCCTGCTGAG AAAGGCATGA AGTAGTTAC 59 
(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GAGAGGGCCT GGCCCGCATG GAGCTGTTTT TATTCCTGAC CTTC 4 4 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CAGGAGTTGT GTCAAGGTCC TTTGGGTCAA TCAG 34 
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(2) INFORMATION* FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGy: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TTGTCAATGG ATTTGCTTCT GTCCCGCCCT TCTATCAGCT GTGCTTCATT CCTGTCTGAG 60 
GATC 64 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOCOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CAGAAGCAAA TCCATTGACA ACAGGAGTTG TGTCAAGGTC CTTTGGGTCA ATCAG 55 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 60 base pairs '< 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
35 CTCAGACAGG AATGAAGCAC AGCTGATAGA AGGGCGGGAC AGAAGCAAAT CCATTGACAA 60 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GCAGCCAGAC CATCTGTGCT TCTTCAGACA GG 32 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 base pairs 
5 (3) TYPE: nucleic acid 

(C) STRANDEDN2SS : single 

(D) TOPOLOGY : linear 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

CACCATATTA ACTTCCCTCA CTTCTGTGCT ACATGACAAC AAAG 44 

(2) INFORMATION FOR SEQ ID NO: 61: 

75 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
AATTCTTTGT TGTCATGTAG CACAGAAGTG AGGGAAGTTA ATATGGTGGT AC 52 

25 

Claims 



1. A method for evaluation of the safety of a chemical compound, which comprises the steps of: 

so (a) reacting a chemical compound with recombinant yeast cells producing human cytochrome P450 

molecular species P450 1A2, P450 2C9, P450 2E1 and P450 3A4 together with a yeast NADPH- 
P450 reductase, which may be in the form of a fused enzyme with each of said human cytochrome 
P450 molecular species, or with the cell free extracts of the yeast cells; and 
(b) analyzing the resulting metabolite to determine the safety of the compound, 

35 

2. The method according to claim 1, wherein the recombinant yeast cells are yeast cells transformed with 
plasmids having a gene coding for human cytochrome P450 1A2, P450 2C9, P450 2E1 or P450 3A4 
together with a gene coding for yeast NADPH-P450 reductase. 

40 3. The method according to claim 1 or 2, wherein the recombinant yeast ceils are yeast cells transformed 
with plasmids each of which has a fused gene comprising a gene coding for the human cytochrome 
P450 molecule on the 5'-terminal and a gene coding for the yeast NADPH-P450 reductase on 3'- 
terminal. 



45 4. The method according to any one of claims 1 to 3, wherein the analyzing of the metabolite is carried 
out by the Ames Test. 

5. The method according to claim 4, wherein the test is carried out using His" Salmonella or Trp" 
Escherichia coii. 

50 

6. The method according to any one of claims 1 to 5, wherein the recombinant yeast cells further produce 
at least one additional human cytochrome P450 molecular species selected from a group of human 
cytochrome P450 2A6, P450 2C19 and P450 2D6. 

55 7. The method according to any one of claims 1 to 6, wherein the recombinant yeast cells further produce 
at least one additional human cytochrome P450 molecular species selected from a group of human 
cytochrome P450 1A1, P450 2B6, P450 2C8 and P450 2C18. 
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. An artificial fused enzyme, which comprises human cytochrome p450 3A4 and yeast NADPH-P450 
reductase. 

. A yeast expression plasmid having a fused gene comprising a gene coding for human P450 3A4 and a 
gene coding for the yeast NADPH-P450 reductase. 

0. A method of determining in vitro the human metabolite of a chemical compound, which comprises the 
steps of: 

(a) reacting a chemical compound with recombinant yeast cells producing human cytochrome P450 
molecular species P450 1A2, P450 2C9, P450 2E1 and P450 3A4 together with a yeast NADPH- 
P450 reductase, which may be in the form of a fused enzyme with each of said human cytochrome 
P450 molecular species, or with the cell free extracts of the yeast cells; and 

(b) identifying the resulting metabolite. 



106 



EP 0 644 267 A2 



0*4J 

c c 

M-l CP 
rfl 
U 

aiM 
o : 

0) ^ 

£<=> 



CP 


CP 


C 4J 


c -u 


H C 


*H C 


>i 0) 




^ e 


u-i E 




CP 




<— < (13 


a u 




E U-I 
















S~ 






o . 


U-l o 






cn 


0) ^ 


0) ^ 


*s 


c ^ 
c o 






u ^ 









CP 


CP 


C ±J 


C -U 


H C 


•H C 


>i OJ 


>. o 


£ 


iu E 


•H CP 


-H 01 




fO 


a u 


a u 


E4-I 


E 4-1 








JO 


u£ 




0 v> 


Oo 


m • 


lM J 


o 












E ^ 




-H ° 













I _ 

O CO 

o 

»~* I — ' 

-< o 

o -< 

CD -< 

f- -< 

C_> f— 

(—* CJ 

CJ> CD 

c_> o 

e> i — • 

CD I— 

*— 

c_> -c 

o o 

C_> H- 

O t_> 

(— C-> 

o c_> 

CD CD 

-< o 

o -< 

-< c_> 

<_> -< 

-c *— 

C_> f— 







rO 




1— 


1 


1 




cj 


<_> 


C_D 




CP 




-< 


r-> 


t— 


-< 




1 


CD 


— C 




C_) 


h- 






1— ■ 


t— 


o 


I — • 


»— 


CJ 


1— 




<_> 


CJ 




C3 


1— 1 


o 




I— 


o 




1 — • 


1 — • 


C7 


CD 




c_> 


f— 


1— 


i— 


-< 


CJ> 


CD 


C3 




1— > 


f— 


-< 




-< 


1— 


O 


1— 


o 


-< 






o 


»— 


f— 


-< 


CD 


c_> 


tj> 


O 


CD 


1— 


-< 


CJ3 


-< 


»— 


-c 


1 — ' 


CJ 


-<: 


c_> 


O 


o 


cd 


1 — ' 


-c 


CD 


CD 


CD 






f— 


-c 


— < 


CJ 


-< 


-c 




CD 


-C 




-< 


-< 


CD 


-< 


— <: 


c_> 


—c 


—c 




CD 


1— 


— c 




-C 


t_> 




i — • 


CD 


I— 




t— 


t— 




1— 


t_> 


CJ 


-< 

t 


o 
1 


t 


-< 
1 




U-3 


in 







o 






<-> 


t 


i 

CD 


**> 


1 




1 


CD 




CD 


CD 


♦— 




-< 


CD 


CD 


1— 


CD 


-< 


-< 


CD 


-< 


CJ 


CD 


♦— 


CJ 


»— 


CD 


o 


CJ 


I— 


C_> 


CJ 


1— 


CD 


♦— 


CJ 


-< 


CD 


O 


CD 


CD 


CD 


CJ 




CD 


CD 






CJ 


CD 


C3 


-c 


CJ 


CD 


*— 


o 


CD 


CJ 


t_> 




-c 


CD 


1 — ■ 


CJ 


CD 


1— 


CD 


»— 


-< 




1— 


CJ 


CD 


CJ 


-< 


CD 


J— 


CD 


~< 


CD 


— < 


1— 


o 


CJ 


— < 


»— 


V— 


CJ 


CJ 


»— 


f— 


1— 


CD 


CD 




— <; 


CD 


»— 


-< 


CD 


CD 


»— 


CD 


CD 


-< 


CJ 


-< 


> 


CJ 


— : 


o 


CJ 


-< 


-< 


CJ> 


Y-~ 


-c 


-< 


o 


CJ 


-< 


— <: 


CJ 


CJ 

i 


CD 
t 


CD 
1 






vo 


to 



Fig. 1 



1 07 



EP 0 644 267 A2 



CP 


CP 


C 4J 


C 4J 


•-1 c 


•H C 


>i o 


>i 0) 


^ E 


u-i c 


•«-^ CP 


•H CP 


— i 


—« to 


a m 


a u 


Eu-* 










JO 


M w 




0 *° 


O 01 












<D •< 




*S 












D-i eu 



CP JJ 

c c 

H CU 

>i e 

4-1 CP 
■H ro 
U 

a,u_i 

c 

03 -° 
U ° 

o * 



CU cu 



CP-U 


CpxJ 


CP4J 


c c 


c c 


c c 


•h a 


•H CU 


•H CU 


>• E 




>iE 


u_4 CP 


U-I CP 


CP 


•H (0 


•H 01 


*H (TI 


— « u 








a u-i 


am 


E 


E 


E 






r0 -jo 


















o * 

U-I ° 




# 10 




cu ^ 
So 


ajcv, 

E o 


E~ 


•H LO 


•H LO 








M^r 




CXt eu 


CU cu 



» •— 

y— o 

u> i— 

t_> 

t— cp 

CJ «— ' 

Xjy >— ' 

CD t_> 

»— CJ 

CJ — < 



tD 
O -< 
(_> CJ? 
*— 



<J3 <LD 

-c -< 
— < -< 



-< •< 

C_> CD 

CD CD 

I — ■ f— 

CD t— 

f— CJ 

-C I— 

-< cj 

CO CJ> 

CJ t— 

cj cj 

f— CD 

t— -< 

*— CD 

t— C_> 

CJ f— • 

CD CD 

-< CD 

-< f— 

-< CJ 

CD -< 

-< O 

I— I— • 

c_> -< 

I — • 

CD CD 

CD CJ 



f 



I 







CD 








CO 


n 


1 








1 


i 


CD 




CO 




CD 


CD 


t— 


1 


1 


1 


CD 


CD 


h- 


C_> 


CD 


CD 


t— 


t— 


CD 




CD 




1— 


CD 


CD 




-< 


CJ 


-< 


CD 


*— 




CD 


t-» 


CD 


-< 


— C 


CD 


CJ 


CJ 


J— 


CD 


CD 


-< 


f— 


CD 


CD 


CD 


CD 


-< 


CD 


CJ 


-< 


CD 


CD 


CD 


CD 


1— 


CD 


CD 


-< 


CD 


-< 


CJ 


-< 


-< 


CD 


C— 




CD 


CD 


CD 




CD 


CD 


CJ 


-C 


-< 


CD 


-< 


fr- 


CD 


CD 


-< 


C_> 


-C 


CD 


-< 


CD 


f— 


CD 


CD 


CD 


CJ 


1— 


CD 


CD 


1— 


-< 


— c 


CD 


CD 


1— 


CD 


CD 


CJ 


CD 


-< 


CD 


CD 




CD 


-< 


CD 


CD 


"< 




CD 


CD 


CD 


I — • 


CD 


CD 


CJ 


CD 




-C 


f— 


-< 


-< 


-< 


-< 


-< 


f— 


CJ 


(— 


CD 


CD 


CD 


CD 


o 


CD 


~< 


t— 


-< 


CD 


1— 


CD 


-< 


CD 




-< 


f— ■ 


1— 


CD 


I — • 


CD 


CD 


CJ 


CJ 


CD 


CD 




CD 


1— 


CJ 


(— 


CD 


1 — ' 


f— 


-< 


— C 


-< 


CD 


CJ 


CD 


-< 


CD 


CD 
> 


CD 


CD 
t 


CJ 
( 


CD 
1 


-< 
1 




UD 


LO 


LO 


UO 





Fig. 2 



108 



EP 0 644 267 A2 



m 
u-i 



a 

03 -° 



*"* CO 

e <=> 

•'-1 kO 

a. 



c c 

>iE 
-<-* ITJ 

E 

u « 

0 J 

■ 

e <=> 

01 Ol. 



Cn c 

C CJ 
•H E 

>i CP 
U-l 03 

a 

B 

rO ^ 
cn 

GO 

* o 
oj ^ 

Ho 

■H vo 
H -r- 



4J 

c a> 

•h B 

4-1 aj 

a 

M * 



On 
E o 
wo 

0l»cu 







CO 












1 


ro 




CO 






CJ 


1 




1 






cj 


CJ 




CJ 


CO 


CO 


f— 


1— 


CO 


CD 


1 


l 


CD 


t-» 


1 


"< 


1— 


CD 


CD 


-< 


CD 


-< 


cj 


*— 


t— 


c_> 


CD 


CJ 


CJ 


( — < 


CD 


CD 


I— 


1— t 


1— 


I— 


f— 


-< 


CJ 




cj 


O 


I— 




1— 


1— 


o 


o 


f— 


o 


o 


CJ 


' 


f-~ 


1— 


CD 


CD 


CD 


CD 


-< 


CJ 


CD 


' CD 


CD 


o 


-< 


cj 


CD 


*— 


K— 


CD 


o 


-<: 


e-» 


CD 


CD 


-< 


CD 


~< 


*■< 


t— 


-< 


CJ> 


I— « 


CD 


CD 


C_> 


CJ 


(— 


CJ 


CD 


-< 


CD 


-< 


cj 


CJ> 


1— 


O 


-< 


1— ' 


-< 


C_) 


-< 


CJ> 


CJ 


1 — 


-< 


-< 


-< 


CD 


CJ 


CD 


CD 


CD 


CD 


-< 




-< 


CD 


*— 


-< 


CJ 


1 — 


CJ 


t— ' 


— < 


f— 


CD 


CD 


1— ■ 


-< 


-< 


CJ 


CD 


CD 


-< 


-c 


CD 


1— 


CJ 


*— 


f — ■ 


-<: 


(— 


CD 


1— 


~< 




-< 


CJ 


-< 


-< 


-< 


CJ 


CD 


CJ 


-< 


CD 


CD 


-< 


-< 




CD 


-< 


-< 


-c 


f— 


— < 


-< 


CJ 




-< 


CJ 


CD 


CD 


CD 


CJ 


CJ 


f— 


CD 


-C 


-< 


1— 


CJ 


CJ 


CD 


-< 


f-» 


CD 


CJ 


CJ 
1 


CD 
1 


CD 
1 


-< 


-< 
1 


CJ 

1 


CO 




to 


LO 


W3 


UD 










CO 




CO 




CO 








ad 




CJ 




CJ 








CvO 









ro 
I 



CD CJ 
-< CJ 

I — • CD 

I— h- 

CD CJ 

-C I— 

CJ -< 

CD t— 

I- CJ 

CJ -C 

CJ -< 

-< 

-< CJ 

CJ 

-< CD 

-< -< 

CJ 

CD CD 

CD -< 

t— CD 

f~* CD 

-< -< 

' CJ 

f— -< 

-< CJ 

CD CD 

CJ -< 

CJ CJ 

CJ -< 

I I 



Fig. 3 



109 



* EP 0 644 267 A2 



O 

CO 



.a 

u 
u 
cn 



CP 


CP 


V 


C JJ 


CJJ 


c 


<H C 


•H C 


a> 


>i 0) 


>iQJ 


ens 


*m e 


m e 


C CP 


ZP 


— 1 CP 


■<-t ro 




— < ra 




aM 


aM 








• H 






<-H •— . 






Cu o 








o - 


0 w» 




4-1 ' 


u-r • 




o 


o 




^to 


U «o 


o ^ 


a o 


a> <=> 




E <~ 






—i ° 
W~ 


■H ° 


u -«* 

c o 







i 



i 

in 



.Q 



5 

O • 
sz o 

(U O 

a -u 
c 

QJ O 

3 CM 

cr 

tn o 
x: a 







**> 
1 


o 
1 






o 


CD 




rO 


1- 


-< 




1 


o 


o 


1 


e> 


CJ 


-c 


o 


u 


♦— 


c_> 


-< 


CD. 


1— 


cd 


o 


•< 


o 


CD 




-< 


cd 


CD 


1— 


tD 


o 




«_> 


o 


1— 


CJ 


o 


\— 


o 


CD 


C3 


1— 


c_> 


-< 




-c 




t- 


»- 


1— 


-< 


CJ 








t— 


CD 






1— 


-< 


1— 


-< 


-< 


O 


CD 


C3 


cj 




(_> 


o 




I— 


1— 


-< 


o 


o 


o 


1— 


o 




LD 


-c 


-< 


C3 


CD 


-< 


»— 


-< 


-< 


CD 


CD 


o 




o 


CD 


*— 




1— 


-< 


»— 




»— 


V— 


C3 


-< 




o 


1— 
1 


-< 
1 


1 


-c 
1 










VO 


w> 







- -< 

CO CJ) 



0) 
E 
>i 

N 
C 
G) 

tJ 
<U 
to 
a 



^ CD CJ 



-P 
U 



cj» c_> 
O -< 



CJ> C_3 

o o 

CD CD 



C_> -< 
I— O 
-< CD 



Fig. 4 



110 



EP 0 644 267 A2 



CJ 

CP (_» 
-< I— 



CP x-> 

cp c_» 

C5 <_) 

CP «J 

CP CP 



CD LP 

-< 

O CP 

-< I— 

<_> t3 

C_> CD 

CP C_> 

CP O 

O CP 

t— -c 



< 



CP -c i— 

C cj cd 

CP CP 

"H CP CP 

— »— -C 

~* CP CP 

0 ■ e 



U 

O 

M 

QJ 



Q 

Cvl 

C 
•«~f 

c 
o 



u 
o 

M 
C 



CD CP 

I— -c 

CD CJ 

CP CD 

CP CP 

CP CJ 

o cj 

f— -c 

CP CP 

CP cp 

CP (J 

CD CP 

CD c_> 

»— -c 

CD C_> 

CD O 

I— -< 

CJ CP 

-< I — • 

CJ CD 

CP CJ 

-< »— 

-< I— 

CP CJ 



CJ CP 
CP CJ 



CP CP 
CP CJ 



* — ■ -< 

CJ CP 

O CP 

-< 

-< t— 

CP CP 

CP C-> 

CP CJ 

CP CJ 

1— -< 

CP CP 

CP CP 

CD CD 



Fig. 5 



111 



EP 0 644 267 A2 



Sc 



](1.5kb) 



pUCUB/Sac!, Hindi 



ASH-P ADH-T 



H 



H N 



p AAH5N 
YR ADH-P AOH-T 



H N 



pAHRR 



Sad 

Hlndlll 
Fill In 



linker(40bp) 
H Sc 
■ 

I pUCHS /Hlndlll, Sad 

T 




Hlndlll 



Hindll 



Sad, EcoRI 



H Sc 



1 



T 

H H 



EcoRI 
Fill In 

Hind III linker 
Hlndlll 



H 

2JC 



3a. 



p1A2 



p1A2R 



Fig. 6 



112 



EP 0 644 267 A2 



p2C9 



p2C9R 



](0.9 kb) 



Pstl 

pUCB /Hindi, Pstl 

X P H K 




H X X H 



| Noil 

N X X M 



N X X N 

"mw//////M m 



ADH-P AOH-T 



Noll 



H - . H H 
pAAHSN 

YR ADH-P ADH-T 



Notl 



H H N 

pAHRR 



Fig . 7 



113 



EP 0 644 267 A2 



I I <0.5kb) 
— B 



pUC118/EcoRI, BamHI 




AOH-F AOH-T 



H H H 

pAAHSN 



Hlndill 



T 

H 



Hlncilll 



DSL 



p2E1 



VR ADH-P A DH-T H H 

15 p2E1 R 



n H n Hindlll 
pAHRR 



Fig . 8 



114 



x EP 0 644 267 A2 



Sc 



pUCl18/Sacl, Hindi 



AOH't> AOH-T 



N H H 

pUCAN 

HIndlll 
Fill In 
Xbal linker 

ADH-P AOH-T 



N X H 

pUCANX 



Xbal 



(0.6kb) 
Sad 



E Sc H 

i r— 1 1 




> 


f Sphl 




^ Blunting 




Xbal linker 




Xbal 



p3A4 



p3A4R 



H X X N 



3SL 



Noll 

Y 

H X X H 

□a. 



H X X N 

1 y////////A geg 



Noll 



ADH-P AOH-T 



Noll H H N 

pAAHSN 



ADH-P AOH-T 



H H N 

pAHRR 



Fig . 9 



115 



EP 0 644 267 A2 




pUCl9/Htncll 



AOH-P A DH-T 

N H N 
pAAHSN 

AOH-P AOH-T 



Hindlli 



N H N Htndll! 
pAHRR 



HIndlll 



T 

H 



\W//M~ 



p1A1 



S_ p1A1R 



Fig. 10 



116 



EP 0 644 267 A2 




ADH-P AOH-T 



JHH»S_ — >- 1^-^! W////M B5 p2A6R 

H UN Hindlll 
pAHRR 



Fig. 11 



117 



EP 0 644 267 A2 




wmm_ 

H H N 
pAAHSN 



Hlndfll 



j Hlndtll Partial 



p2B6 




Fig. 12 



110 



EP 0 644 267 A2 



AOH-P ADK-T 

mzm 

N H N 
pAAHSN 




Hindlll 



Hindlll 



DBL 



p2C8 




H N Hindlll 



H H 



IH_ p2C8R 



Fig. 13 



119 



EP 0 644 267 A2 



{1.4 kb) 



pUCA /Hindi, Pstl 




N X X H 



Noll 



p2C18 



H X X H 



Not! 



ADH-P AOH-T 



H H N 

pAAHSN 



P2C18R \:,**&rJiv\ 



N X X U 



Noll 



YR ADH-P ADH 'T 

H H 



H 

pAHRR 



Pig. 14 



120 



x EP 0 644 267 A2 




121 



x EP 0 644 267 A2 



linker 

(40bp) (906p) 
X B» Ba SmP 



Bli>« Script {*)/Xb«f, Ptu 




___J{0.9kb) 

Kpntj K 

^pUCA/Kpnl.HlnclI 

HXNv KH 

Ml 11 

NtpY, Xb«] 



M H H 

pAAHSN 



HindlM 



p2D6 



H H N Hindlll 
pAHRR 



p2D5R 



Fig. 16 



122 



EP 0 644 267 A2 



p3A4 
pen 



X Dq 



pBFCRI 

Xhol, Hindi!! 



Xbol, Xhof 



Xho 



Xba 



h-Blu» Scrlpt(+) - Xhol, Hlndlll 
Xho T HXb. 



Xhol, Xbai 



Xho 



Xbi 



Xho 



T 

Xho 



Nott 



pUCAN - XbaJ 



Xb* u 

J_L 



pAAHSN - NoU LarflQ fragment 
pF3A4 



Xba N 



Fig. 17 



123 



EP 0 644 267 A2 



pUC19 

EcoRI 



pAAHS 
Xhol 



pAARN 
Notl linkar | Hlndlil 

HP h 




Fill In 
Xbat llnkaf 
Xbal, Sail 



PS X 

mm. 



Xbal, PstI 
X 



pUC19 

| Xbal, Sail 



2jiDNA 



pURL 



] 

HP 


I 

P 


H 






, Hlndlil 






Fill in 






Ligation 




HP 


P 





K H HP P 

pURLG 1 
-1 Hlndlli 



PstI, Xbal 
P 



pUGAP 

I Natl 
T 

MHN 




P450 cDNA 
Hlndlil 



N h H KP 



Fig. 18 



124 



